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These are some class notes distributed in a multivariable calculus course tought in 
Spring 2004. This was a physics flavored section. 

Some of the pages were developed as complements to the text and lectures in the years 
2000-2004. While some of the pages are proofread pretty well over the years, others 
were written just the night before class. 

The last lecture was "calculus beyond calculus". Glued with it after that are some 
notes from "last hours" from previous semesters. 



Oliver Knill. 5/7/2004 



Lecture 1: VECTORS DOT PRODUCT 



O. Knill, Math21a 



HOMEWORK: Section 10.1: 42,60: Section 10.2: 4,16 



VECTORS. Two points Pi = (xi,yi), Q = P2 = (#2? 2/2) m the plane determine a vector v = (#2 —^1,2/2 — Hi)- 
It points from Pi to P2 and we can write Pi + v = P2- 

COORDINATES. Points P in space are in one to one correspondence to vectors pointing from to P. The 
numbers Vi in a vector v = (^1,^2) are also called components or of the vector. 

REMARKS: vectors can be drawn everywhere in the plane. If a vector starts at 0, then the vector v = (vi, 1*2) 
points to the point (v\,V2). That's is why one can identify points P = (a,b) with vectors v = (a,b). Two 
vectors which can be translated into each other are considered equal. In three dimensions, vectors have three 
components. In some Encyclopedias like Encylopedia Britannica define vectors as objects which have "both 
magnitude and direction". This is unprecise and strictly speaking incorrect because the zero vector is also a 
vector but has no direction. 



ADDITION SUBTRACTION, SCALAR MULTIPLICATION. 




U + V = (u±, U2) + (Vi, V2) U — V = (ui,U2) — (Vi, V2) \u = \(ui,U2) 

= (ui + vi, u 2 + v 2 ) = (ui - vi, u 2 - v 2 ) = (Xui,Xu 2 ) 



BASIS VECTORS. The vectors i = (1, 0), j = (0, 1) are called standard basis vectors in the plane. In space, 
one has the basis vectors i = (1, 0, 0), j — (0, 1,0), k = (0, 0, 1). 

Every vector v = (^1,^2) in the plane can be written as a sum of standard basis vectors: v = v\i + V2j- Every 
vector v = (vi,V2,Vs) in space can be written as v = v\i + V23 + v^k. 

WHERE DO VECTORS OCCUR? Here are some examples: 



Velocity: if (f(t),g(t)) 
is a point in the plane 
which depends on time t, 
then v = (f'(t),g'(t)) is 
the velocity vector at 
the point (f(t),g(t)). 



Forces: Some prob- 
lems in statics in- 
volve the determina- 
tion of a forces acting 
on objects. Forces 
are represented as 
vectors 



Fields: fields like elec- 
tromagnetic or gravita- 
tional fields or velocity 
fields in fluids are de- 
scribed with vectors. 




Qbits: in quantum 
computation, one 
does not work with 
bits, but with qbits, 
which are vectors. 



K 



i 



Color can be written as 
a vector v = (r,g,b), 
where r is red, g is green 
and b is blue. An other 
coordinate system is v = 
(c, m,y) = (1 — r, 1 — 
<7, 1 — 6), where c is cyan, 
m is magenta and y is 
yellow. 




SVG. Scalable 
Vector Graphics is 
an emerging stan- 
dard for the web 
for describing two- 
dimensional graphics 
in XML. 




VECTOR OPERATIONS: The ad- u + v = v + u commutativity 

dition and scalar multiplication of u + (v + w) = (u + v) + w additive associativity 

vectors satisfy "obvious" properties. w + = + w = null vector 

There is no need to memorize them. r * (s * v) = (r * s) * v scalar associativity 

We write * here for multiplication (r + s)v = v(r + s) distributivity in scalar 

with a scalar but usually, the multi- r(v + w)=rv + rw distributivity in vector 

plication sign is left out. 1 * v = v the one element 



LENGTH. The length \v\ of v is the distance from the beginning to the end of the vector. 
EXAMPLES. 1) If v = (3, 4) , then \v\ = V^E = 5. 2) \i\ = \J\ = k\ = 1, |0| = 0. 



UNIT VECTOR. A vector of length 1 is called a unit vector. If v ^ 0, then v/\v\ is a unit vector. 
EXAMPLE: If v = (3, 4), then v = (2/5, 3/5) is a unit vector, k are unit vectors. 



PARALLEL VECTORS. Two vectors v and w are called parallel, if v = rw with some constant r. 



DOT PRODUCT. The dot product of two vectors v = (i>i,i>2, ^3) and w = (wi, 11)2,11)3) is defined as 



V ■ W = V\W\ + V2W2 + V3W3 



Remark: in science, other notations are used: v- w = (v, w) (mathematics) < v\w > (quantum mechanics) ViW 1 (Einstein 
notation) gijV t w 3 (general relativity). The dot product is also called scalar product, or inner product. 



LENGTH. Using the dot product one can express the length of v as \v\ = \Jv ■ v. 
CHALLENGE. Express the dot product in terms of the length alone. 
SOLUTION: (v + w, v + w) = (v, v) + (w, w) + 2(v, w) can be solved for (v, w). 



ANGLE. Because \v — w\ 2 = [v — w, v — w) = \v\ 2 + \w\ 2 — 2(v, w) is by the 
cos-theorem equal to \v\ 2 + \w\ 2 — 2\v\ ■ \w\ cos(a), where a is the angle 
between the vectors v and w, we get the important formula 



\v ■ w — \v\ ■ \w\ cos(a) 



CAUCHY-SCHWARZ INEQUALITY: \v ■ w\ < \v\\w\ follows from that formula because | cos(a)| < 1. 
TRIANGLE INEQUALITY: \u + v\ < \u\ + \v\ follows from \u + v\ 2 = (u + v) ■ (u + v) = u 2 + v 2 + 2u ■ v < 
u 2 + v 2 + 2\u ■ v\ < u 2 + v 2 + 2\u\ ■ \v\ = (\u\ + \v\) 2 . 



FINDING ANGLES BETWEEN VECTORS. Find the angle between the vectors (1,4,3) and (-1,2,3). 
ANSWER: cos(a) = 16/(y/26VU) - 0.839. So that a = arccos(0.839..) - 33°. 



ORTHOGONAL VECTORS. Two vectors are called orthogonal ]fvw = Q. The zero vector is orthogonal 
to any vector. EXAMPLE: v = (2,3) is orthogonal to w = (-3, 2). 



PROJECTION. The vector a = proj^(v) = w(v ■ w/\w\ 2 ) is called the 
projection of v onto w. 

(Its ab- 



comp^(-) = (v ■ w)/\w\ 



The scalar projection is defined as 
solute value is the length of the projection of v onto w.) The vector 
b = v — a is called the component of v orthogonal to the w;-direction. 




EXAMPLE. v= (0,-1,1), w = (1,-1,0), proj^(v) = (1/2,-1/2,0), comply) = l/y/2. 



Lecture 2: CROSS PRODUCT 



Math21a, O. Knill 



HOMEWORK: section 10.2: 28,42, section 10.3: 12,20 



CROSS PRODUCT. The cross product of two vectors 
v = (vi,V2,vs) and w = (wi,W2,ws) is defined as the vector 

V X W = (V2W3 — V3W2,VsWi — ViU)3,ViU)2 ~ V2W1). 



To compute it: 
multiply diagonally 
at the crosses. 



Vi 


v 2 


v 3 


Vl 


v 2 






X 


X 


X 


w 1 


w 2 


w 3 


Wi 


w 2 



DIRECTION OF v x w: | v x w is orthogonal to v and orthogonal to w. 
Proof. Check that v ■ (v x w) = 0. 



LENGTH: \v x w\ = \v\ \w\ sm(a) 



Proof. The identity | vxw | 2 = \v\ 2 \w\ 2 — (v-w) 2 can be proven by direct computation. Now, \v-w\ = \v\ \w\ cos (a). 
AREA. The length \v x w\ is the area of the parallelogram spanned by v and w. 

Proof. Because \w\ sin(a) is the height of the parallelogram with base length \v\, the area is \v\\w\ sin(a) which 
is by the above formula equal to \v x w\. 

EXAMPLE. If v = (a, 0,0) and w = (6cos(a), 6sin(a), 0), then v x w = (0, 0, abs'm(a)) which has length 
\ab sin(a)|. 

ZERO CROSS PRODUCT. We see that v x w is zero if v and w are parallel. 



ORIENTATION. The vectors v,w and vxw form a right handed 
coordinate system. The right hand rule is: put the first vector v on 
the thumb, the second vector w on the pointing finger and the third 
vector v x w on the third middle finger. 

EXAMPLE. i,j,i x j = k forms a right handed coordinate system. 




DOT PRODUCT (is a scalar) 

v ■ w = w ■ v commutative 

\v ■ w\ = \v\\w\ cos(ai) angle 

(av) ■ w = a(v ■ w) linearity 

(u + v)-w = u- w + v- w distributivity 

{1, 2, 3}. {3, 4, 5} in Mathematica 



CROSS PRODUCT (is a vector) 

v x w = — w x v anti-commutative 

\v x w\ = \v\\w\ sin(a) angle 

(av) x w = a(v x w) linearity 

(u + v) xw = uxw + vxw distributivity 

Cross[{l, 2, 3}, {3, 4, 5}] in Mathematica 

^(v x w) = v x w + v x w product rule 



TRIPLE SCALAR PRODUCT. The scalar f 



= u ■ (v x w) is called the triple scalar product of u, v, w. 



PARALLELEPIPED, [u, u, w] is the volume of the parallelepiped 
spanned by u, v, w because h = u ■ n/\n\ is the height of the par- 
allelepiped if n = (v x w) is a normal vector to the ground par- 
allelogram which has area A — \n\ = \v x w\. The volume of the 
parallelepiped is hA = u ■ n\n\/\n\ = \u ■ (v x w)\. 




EXAMPLE. Find the volume of the parallel epiped which has the one corner O = (1, 1,0) and three corners 
P = (2, 3, 1), Q = (4, 3, 1), R = (1, 4, 1) connected to it. 

ANSWER: The parallelepiped is spanned by u = (1,2,1), v = (3,2,1), and w = (0,3,2). We get v x w 
(1, —6, 9) and u ■ (v x w) = —2. The volume is 2. 



DISTANCE POINT-LINE (3D). If P is a point in space and L is the line which 
contains the vector u, then 


P 




d(P,L) = \PQ x u\/\u\ 










is the distance between P and the line L. 







PLANE THROUGH 3 POINTS P, Q, R: 

The vector ft = PQ x PR is orthogonal to the plane. We will see next week that n = (a, 6, c) defines the plane 
ax + by + cz = d, with d = axo + byo + czq which passes through the points P = (xq, yo, zq), Q, R. 



The cross product appears in many different applications: 

ANGULAR MOMENTUM. If a mass point of mass m moves along a curve f(t), then the vector L(t) = 
mr(t) x f'(t) is called the angular momentum of the point. It is coordinate system dependent. 



ANGULAR MOMENTUM CONSERVATION. 

^-L(t) = mf y (t) x r*(£) + mf(t) x r"(t) = f(t) x F(t) 
In a central field, where F(t) is parallel to r(£), we get d/dtL(t) = which means L(t) is constant. 



TORQUE. The quantity r(t) x F(t) is also called the torque. The time derivative of the momentum p = mr' 
is the force F. the time derivative of the angular momentum L = f(t) x p(t) = mr(t) x f(t) is the torque. 



KEPLER'S AREA LAW. (Proof bv Newton) r , t , 


The fact that L(t) is constant means first of all 
that r(t) stays in a plane spanned by r(0) and 
r*(0). The experimental fact that the vector r(t) 
sweeps over equal areas in equal times ex- 
presses angular momentum conservation: \f(t) x 
f(t)dt/2\ = \Ldt/m/2\ is the area of a small 
triangle. The vector r(t) sweeps over an area 
J T \L\dt/(2m) = \L\T/(2m) in time [0,T]. 









MORE PLACES IN PHYSICS WHERE THE CROSS PRODUCT OCCURS: 



The top, the motion of a rigid body is describe by the angular momentum L and the angular velocity vector 
in the body. Then L = L x Q + M, where M is an external torque obtained by external forces. 



Electromagnetism: (informal) a particle moving along r(t) in a magnetic field 

B for example experiences the force F(t) = qf'(t) x B, where q is the charge of 
the particle. In a constant magnetic field, the particles move on circles: if m is 
the mass of the particle, then mf"(t) = qf'(t) x B implies mf'(t) = qf (t) x B. 
Now d/dt\f\ 2 = 2f ■ f ' = f ■ qf (t) x B = so that \f\ is constant. 



Hurricanes are powerful storms with wind velocities of 74 miles per hour or more. 
On the northern hemisphere, hurricanes turn counterclockwise, on the southern 
hemisphere clockwise. This is a feature of all low pressure systems and can be 
explained by the Coriolis force. In a rotating coordinate system a particle of mass 
m moving along f(t) experience the following forces: muj' x f (inertia of rotation), 
2muj x f 1 (Coriolis force) and mcu x (uj x f)) (Centrifugal force). The Coriolis force 
is also responsible for the circulation in Jupiter's Red Spot. 




Lecture 3. DISTANCES 



Math21a, O. Knill 



DISTANCE POINT-POINT (3D). If P and Q are two points, then 

d(P,Q) = \PQ\ 
is the distance between P and Q. 



DISTANCE POINT-PLANE (3D). If P is a point in space and E : ft ■ x = d is 
a plane containing a point Q : then 

d(P,Z) = \(PQ)-n\/\n\ 

is the distance between P and the plane. 




DISTANCE POINT- LINE (3D). If P is a point in space and L is the line 
r(t) = Q + tu, then 

d(P,L) = \(P~Q) x u\/\u\ 
is the distance between P and the line L. 




DISTANCE LINE-LINE (3D). L is the line r(t) = Q + tu and M is the line 
s(t) = P + tv, then 



d(L, M) = \{PQ) ■ {uxv)\/\uxv\ 
is the distance between the two lines L and M. 




DISTANCE PLANE-PLANE (3D). If n ■ x = d and n ■ x = e are two parallel 
planes, then their distance is (e — d)/\n\. Nonparallel planes have distance 0. 




EXAMPLES 



DISTANCE POINT-POINT (3D). P = (-5,2,4) and Q = (-2,2,0) are two 
points, then 



d(P,Q) = \PQ\ = v / (-5 + 2) 2 + (2-2) 2 + (0-4) 2 = 5 



DISTANCE POINT-PLANE (3D). P = (7, 1, 4) is a point and E : 2x+±y+bz = 
9 is a plane which contains the point Q = (0, 1, 1). Then 

d(P,E) = |(7,0,3) • (2,4,5)|/|V45| = 29/V45 

is the distance between P and E. 




DISTANCE POINT-LINE (3D). P = (2,3, 1) is a point in space and L is the 
line f(t) = (1,1,2) + £(5, 0, 1). Then 

d(P,L) = |(1,2,-1) x (5, 0, l)|/v / 26 = |(2, -6, -10)|/V26 = \/l40/\/26 

is the distance between P and L. 




DISTANCE LINE-LINE (3D). L is the line r{t) = (2,1,4) + £(-1,1,0) and 
M is the line s{t) = (-1, 0, 2) + t(5, 1,2). The cross product of (-1,1,0) and 
(5, 1, 2) is (2, 2, —6). The distance between these two lines is 

d(L, M) = |(3, 1, 2) • (2, 2, -6)|/V44 = 4/V44 . 



DISTANCE PLANE-PLANE (3D). 5x + 4y + 3z = 8 and 5x + 4y + 3z = 1 are 
two parallel planes. Their distance is 7/VEO. 



LINES and PLANES 



Math 21a, O. Knill 



HOMEWORK: Section 10.3: 18,42, section 10.3: 30,32. 



PROBLEM SESSIONS: Tuesdays 3-4pm SC 411 



LINES. A point P and a vector v define a line L. It is the set of points 



\L = {P + tv, where t is a real number } 



The line contains the point P and points into the direction v. 

EXAMPLE. L = {(x, y, z) = (1, 1, 2) + t(2, 4, 6) }. 

This description is called the parametric equation for the line. 



EQUATIONS OF LINE. We can write (x,y,z) = (1, 1, 2) + t(2, 4, 6) so that x = 1 + 2t, y = 1 + 4£, z = 2 + 6*. If 
we solve the first equation for t and plug it into the other equations, we get y = 1 + {2x — 2), z = 2 + 3(2x — 2). 
We can therefore describe the line also as 

L = {(x, y,z) | y = 2x — 1, z = Qx — 4 } 

SYMMET RIC EQUATION. The line r = P + tv with P = (ar , yo, zo) and u = (a, 6, c) satisfies the symmetric 
equations x ~^° = y ~ l J J0 = z ~ c z ° (every expression is equal to t). 



PROBLEM. Find the equations for the line through the points P = (0, 1, 1) and Q = (2, 3, 4). 

SOLUTION. The parametric equations are (x,y,z) = (0,1,1) +£(2,2,3) or x = 2t,y = 1 + 2t,z = 1 + 3t. 
Solving each equation for t gives the symmetric equations x/2 = (y — l)/2 = (z — l)/3. 



PLANES. A point P and two vectors v, w define a plane E. It is the set of points 



| E = {P + tv + su>, where t, 5 are real numbers } | 
The line contains the point P. 

EXAMPLE. £ = {(#, y, «) | (1, 1, 2) + t(2, 4, 6) + s(l, 0, -1) }. 
This is called the parametric description of a plane. 




EQUATION OF PLANE. Given a plane as a parametric equation P = Q + tv + sw. The vector ft = v x w is 
orthogonal to both n and w. Because also the vector PQ = Q — P is perpendicular to n we have (Q — P) • n — 0. 
With Q = (xo, yo, zo), P = (x, y, z), and ft = (a, 6, c), this means ax + by + cz = axobyo + czq = d. The plane is 
therefore described by a single equation \ax + by + cz = d\. 



PROBLEM. Find the equation of a plane which contains the three points P = (—1, —1, 1),( 
(1,1,3). 



= (0,1, 1), R = 



SOLUTION. The plane contains the two vectors v = (1,2,0) and w = (2,2,2). We ha ve n = (4, -2, -2) an d 
the equation is 4x — 2y — 2z = d. The constant d is obtained by plugging in one point: | 4x — 2y — 2z = — "4~j . 

LINES AND P LANES IN MATHEMATICA. 

Plotting a line: I ParametricPlot3D[{l, 1, 1} + t{3, 4, 5}, {t, -2, 2}] I 



Plotting a plane: ParametricPlot3D[{l, 1,1} + t{3, 4, 5} + s{l, 2, 3}, {t, -2, 2}, {s, -2, 2}] 



Finding the equation of a plane 



P = {-1,-1,1};Q= {0,1,1}; R= {1,1,3}; n= Cross[Q-P,R -P];n.{x,y,z} - 



DISTANCE POINT-PLANE (3D). If P is a point in space and n ■ x = d is a 
plane containing a point Q, then 

d(P,L) = \(P-Q)-n\/\n\ 

is the distance between P and the plane. 

You recognize that this is just the scalar projection of the vector QP = P — Q 
onto the vector n. 



DISTANCE POINT- LINE (3D). If P is a point in space and L is the line 
r(t) = Q + tu, then 

d(P,L) = \(P-Q) x u\/\u\ 
is the distance between P and the line L. 

This formula is verified by writing (P — Q) x u = \P — Q\\u\ sin(0). 



DISTANCE LINE-LINE (3D). L is the line f(t) = Q + tu and M is the line 
s(t) = P + tv, then 

d(L, M) = \{P -Q)-(ux v)\/\u x v\ 
is the distance between the two lines L and M. 



This formula is just the scalar projection of QP = P — Q onto the vector 
n = u x v normal to both u and v. 



PLANE THROUGH 3 POINTS P, Q, R: The vector (a, 6, c) = n = (Q - P) x (R - P) is normal to the plane. 
Therefore, the equation is ax + by + cz = d. The constant is d = axo + byo + czo because the point P = (xo,yo, zq) 
is on the plane. 



PLANE THROUGH POINT P AND LINE f(t) = Q + tu. The vector (a, b, c) = n = u x (Q - P) is normal to 
the plane. Therefore the plane is given by ax + by + cz = d, where d = clxq + byo + czq and P = (xq, yo, zq). 



LINE ORTHOGONAL TO PLANE ax+by+cz=d THROUGH POINT P. The vector n = (a, 6, c) is normal to 
the plane. The line is f(t) = P + nt. 



ANGLE BETWEEN PLANES. The angle between the two planes axx + biy + ciz = d 1 and a 2 x + b 2 y + c 2 z = d 2 
is arccos(ni • n 2 /{\n\ \ \n 2 \), where Hi = (ai,bi,C{). Alternatively, it is arcsin(|ni x n 2 \/(\fii \ \n 2 \)). 



INTERSECTION BETWEEN TWO PLANES. Find the line which is the intersection of two non-parallel 
planes a\x + b\y + c\z = d\ and a 2 x + b 2 y + c 2 z = d 2 . Find first a point P which is in the intersection. Then 
r(t) = P + t(ni x n 2 ) is the line, we were looking for. 



LINES IN THE PLANE. 

LINE P = Q + tv. Eliminating t gives a single equation. For example, (x, y) = (1, 2) + £(3, 4) is equivalent to 
x = 1 + 3t, y = 2 + At and 4x — 3y = —2. The general equation for a line in the plane is lax + by = d . 






Lecture 4 2/18/2004, CURVES 



Math21a 



HOMEWORK FOR WEDNESDAY: Section 10.5: 4,10,26,42 (can assume 4.1) 



PARAMETRIC PLANE CURVES. If x(t), y(t) are functions of one variable, denned on the parameter inter- 
val / = [a, 6], then f(t) = (f(t),g(t)) is a parametric curve in the plane. The functions x(t),y(t) are called 
coordinate functions. 



PARAMETRIC SPACE CURVES. If x(t), y(t), z(t) are functions, then f(t) = (x(t), y(t), z(t)) is a space curve. 
Always think of the parameter t as time. For every fixed t, we have a point (x(t),y(t), z(t)) in space. As t 
varies, we move along the curve. 



EXAMPLE 1. If x(t) = t, y(t) = t 2 + 1, we can write y(x) = x 2 + 1 and the curve is a graph. 
EXAMPLE 2. If x(t) = cos(t),y(t) = sin(t), then r(t) follows a circle. 
EXAMPLE 3. If x(t) = cos(t), y(t) = sin(t), z(t) = t, then f(t) describes a spiral. 

EXAMPLE 4. If x(t) = cos(2£), y(t) = sin(2£), z(t) = 2£, then we have the same curve as in example 3 but we 
traverse it faster. The parameterization changed. 

EXAMPLE 5. If x(t) = cos(—t),y(t) = sin(— £), z(t) = —t, then we have the same curve as in example 3 but 
we traverse it in the opposite direction. 

EXAMPLE 6. If P = (a, b, c) and Q = (u, v, w) are points in space, then f(t) = (a+t(u—a), b+t(v—b), c+t{w—c)) 
defined on t 6 [0, 1] is a line segment connecting P with Q. 



ELIMINATION: Sometimes it is possible to eliminate the parameter t and write the curve using equations 
(one equation in the plane or two equations in space). 

EXAMPLE: (circle) If x(t) = cos(t),y(t) = sin(t), then x{t) 2 + y{t) 2 = 1. 

EXAMPLE: (spiral) If x{t) = cos(t),y(t) = s'm(t),z(t) = t, then x = cos(z),y = s'm(z). The spiral is the 
intersection of two graphs x = cos(z) and y = s'm(z). 



CIRCLE 




(cos(£),3sin(t)) 



TRIFOLIUM 




— cos(3£)(cos(£), sin(t)) 



HEART 




(1 + cos(£))(cos(£),sin(£)) 



EPICYCLE 




LISSAJOUS 




(cos(3£),sin(5£)) 



SPRING (3D) 




SPIRAL 




e*/ 10 (cos(£),sin(£)) 



TORAL KNOT (3D) 




(cos(£) + cos(3£)/2,sin(£) + 
sin(3£)/2) 



(cos(£), sin(£), t) 



(cos(£)+cos(9£)/2,sin(£) + 
cos(9£)/2,sin(9£)/2) 



WHERE DO CURVES APPEAR? Objects like particles, celestial bodies, or 
quantities change in time. Their motion is described by curves. Examples 
are the motion of a star moving in a galaxy, or data changing in time like 
(DJIA(t),NASDAQ(t),SP500(t)) 




Strings or knots are closed curves in space. 

Molecules like RNA or proteins can be modeled as curves. 

Computer graphics: surfaces are represented by mesh of curves. 

Typography: fonts represented by Bezier curves. 

Space time A curve in space-time describes the motion of particles. 

Topology Examples: space filling curves, boundaries of surfaces or knots. 



DERIVATIVES. If r{t) = (x(t),y(t),z(t)) is a curve, then |^(t) = ( x '(t), y'{t), z'jt)) = jx,y,z) | is called the 
velocity. Its length |r*(£)| is called speed and v/\v\ is called direction of motion. The vector r"(t) is called 
the acceleration. The third derivative r"' is called the jerk. 



The velocity vector f(t) is tangent to the curve at r(t). 



EXAMPLE. If r{t) 
(-9cos(3£),-4sin(2t), 



= (cos(3t),sin(2f),2sin(£)), then f'(t) = 
-2sin(t)) and r"'(t) = (27sin(3£), 8cos(2£), - 



(-3 sin(3t) , 2 cos(2t) , 2 cos(t)) , 
-2cos(t)>. 



WHAT IS MOTION? The paradoxon of Zeno of Elea: "When looking at 
a body at a specific time, the body is fixed. Being fixed at each instant, there 
is no motion". While one might wonder today a bit about Zeno's thoughts, 
there were philosophers like Kant, Hume or Hegel, who thought seriously about 
Zeno's challenges. Physicists continue to ponder about the question: "what is 
time and space?" Today, the derivative or rate of change is defined as a limit 
(r(t + dt) — r(t))/dt, where dt approaches zero. If the limit exists, the velocity 
is defined. 




EXAMPLES OF VELOCITIES. 



EXAMPLES OF ACCELERATIONS. 



Person walking: 
Signals in nerves: 
Plane: 

Sound in air: 
Speed of bullet: 
Earth around the sun: 
Sun around galaxy center: 
Light in vacuum: 



1.5 m/s 
40 m/s 
70-900 m/s 
Machl=340 m/s 
1200-1500 m/s 
30'000 m/s 
200'000 m/s 
300'000'000 m/s 



Train: 
Car: 

Space shuttle: 

Combat plane (F16) (blackout): 
Ejection from F16: 
Free fall: 

Electron in vacuum tube: 



0.1-0.3 m/s 2 
3-8 m/s 2 
<3G = 30m/s 2 
9G=90 m/s 2 
14G=140 m/s 2 . 
1G = 9.81 m/s 2 
10 15 m/s 2 



DIFFERENTIATION RULES. 

The rules in one dimensions (/ + g)' = f + g' (of)' = cf, (fg)' = f'g + fg' (Leibniz), (/(#))' = f'(g)g' 
(chain rule) generalize for vector-valued functions: (v + w)' — v 1 + w' , (cv)' = cv' , (v ■ w)' = v 1 • w + v • w' 
(v x w)' = v 1 x w + v x w' (Leibniz), (v{f{i)))' = v'(f(t))f'(t) (chain rule). The Leibniz rule for the triple dot 
product [u, v,w] = u ■ (v x w) is d/dt[u, v, w] = [u', v, w] + [u, v', w] + [u, v, w'] (see homework). 



INTEGRATION. If f*{t) and r(0) is known, we can figure out f(t) 
by integration f(t) = f(0) + J Q * f'(s) ds. 

Assume we know the acceleration a(t) = r"(t) as well as initial ve- 
locity and position r*(0) and f(0). Then f(t) = r(0)+tr'(0) + R(t), 
where R(t) = JqV(s) ds and v(t) = J* a(s) ds. 




EXAMPLE. Shooting a ball. If r" (t) = (0,0,-10), r'(O) = 50 100 150 200 ^ 300 
(0, 1000, 2), r(0) = (0, 0, h), then f(t) = (0, lOOOt, h + 2t- 10t 2 /2). 



KNOTS 

The threefoil knot is the space curve 



O. Knill, Math 21a 



f(t) = ((2 + cos(3t/2)) cos(t), (2 + cos(3£/2)) sin(t), sin(3t/2)) 

1) Find an interval [a, 6] on which r(t) parameterizes a closed curve in space. A closed curve in 
space is called a knot. These objects are not only interesting in mathematics or physics. For 
example, DNA of bacteria and some proteins form knots (see the back of the page). 

2) Verify that the projection r(t) = (x(t), y(t)) of this curve onto the £?/-plane is in polar coor- 
dinates by f(t) = 2 + cos(3t/2) 3 6(t) = t. 

3) Sketch this curve. 



Indicate at the crossings, which part of the curve is above the other. 



4) Calculate the velocity vector f"(t) of r(t) at t = 0. If this vector is v = (a, 6, c), then (a, b) is 
its projection on the xy plane. Draw this vector (a, b) in the above ^-projection of the knot. 
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KEPLER'S LAWS 



Oliver Knill 



FROM THE 2 BODY TO THE KEPLER PROBLEM. Two bodies with po- 
sition vectors r\,r<i and mass m\,mi attract each other with the force F = 
—Gm\m2^k, where x = f\ — ri is the difference vector between the bodies. The 
length \F\ of the force is proportional to 1/r 2 . The vector x = T\ — r-2 of length 
r satisfies x" = r*{ — = m^G^ — f\)/\r2 — fi\ 3 — miG{r\ — r2 ) / 1 ^2 — n| 3 = 
— (mi + ni2)Gx I r z . The new problem describes a mass-point with position x 
and mass m = m\ + ttt-2 which is attracted to a fixed force centered at the 
origin. This problem 



-mGxj' 



angular momentum L - 

do not change in time. 



is called the Kepler problem. The 

mx x x' and the energy E = m\x'\ 2 /2 + mG/\x\ 




T HE 2. KEPLER LAW. The "area law" or Keplers second law 
is: I "The radius vector x passes the equal area in equal time." | 
Proof. Because x" is parallel to x, and L = mx' x x, we get 
L' = (use the product law). If follows that the vector x 
stays in the plane spanned by x and x' . We can now use co- 
ordinates x = (r cos(0), r sin(0), 0) to describe the point. With 
£' = (r'cos(0),r'sin(0),O) + (-r sin(0), r cos(0), 0)0'. The con- 
served quantity \L\ = m\x' x x\ — mr 2 6' can be interpreted as 
2m/', where fit) is the area swept over by the vector x in the 
interval [0,t]. We will use the formula 



x(t-i-s) 



x(t) 




\L\=r 2 



later. 



ELLIPSES. An ellipse with focal points S = (-ae,0),S = 
(ae, 0) with eccentricity e is defined as the set of points in the 
plane whose distances f and r to S and S satisfy f + r = 2a. 
From (2a - r) 2 = f 2 = r 2 sin 2 (0) + (2ae + rcos(0)) 2 , we obtain 



q(i-Q 

l+ecos(#) 



the polar form of the ellipse. One can replace 
a with the constant tq = (1 — e)a, the length of f at = 0. The 
polar form becomes then 



, _ r (l+e) 
l+ecos(6>) 



The semiaxes of the 



ellipse have length a and b = a\J\ + e 2 . The area is | = irab \ as 
we will see later in the course. 




THE 1. KEPLER LAW. The radius vector x{t) describes an ellipse. 



Proof. (We show that a particle on an ellipse satisfying the 2. law has the correct acceleration of the Kepler law). From 
the 2. law, we know 6' = L/(mr 2 ). The polar form allows us to find the time derivatives 



a(l-e 2 )sin(0) L 



Lesin(0) 



(1 + ecos(0)) 2 mr 2 ma(l — e 2 ) 
of r(t). (In the last step, we replaced r 2 in the denominator with the polar formula.) The second derivative is 
(!)• 



With the unit vector ft = x/r, one has x" = [x" -n)n. From x = fir, we get x' = n'r + nr', x" = fi"r + 2n'r' ' + nr" . 
Therefore | x" ■ n = ft" ■ nr + 2n' ■ fir' + r" | (2). 

From ft = (cos(0), sin(0), 0), we have n' = 6'n 1 - where n 1 - = (— sin(0), cos(^), 0). This gives ft' ■ n' = (9') 2 . 
Using n ■ ft = 1, we have | n ■ ft' = | (3). From ft" ■ ft + n' ■ n' = and n' ■ n' = (9') 2 = L 2 / (mr 2 ) 2 we obtain 

(4). Plugging (1),(3),(4) into (2) gives 



L 2 r 
(mr 2 ) 2 



L 2 (l+ecos(g)) 
m 2 r 2 a{l-e 2 ) 



x ■ n = r + n ■ nr - 



L 2 ecos{6) L 2 (l + ecos(0)) 



-L 2 



Therefore, 



m 2 a(l — e 2 )r 2 m 2 r 2 a(l — e 2 ) m 2 a(l — e 2 )r 2 
I? 



x" = (x" ■ n)n = (x" ■ ft) — = 



r m 2 a{l — e 2 ) r' K 



= -mG- 



THE 3. KEPLER LAW. Let T be the period of the orbit. It is the time the body needs to go around the 
ellipse once. (If S is the sun and r(t) the orbit of the earth, then T is one year.) The third Kepler law states 
that 



T 2 /a 3 = An 2 /(Gm) is a constant 



Proof. If f{t) is the area swept by the radial vector from time to time £, then fit) = L/(2m) implies 
that the area of the ellipse A = -na 2 \J\ — e 2 is equal to LT/{2m). From T = 2m r na 2 \J\ — e 2 /L, we obtain 
T 2 /a 3 = 4m 2 7r 2 a(l - e 2 )/L 2 = 47r 2 /(Gm). 



MEASURING THE GRAVITATIONAL CONSTANT. Note that the 3. Kepler law allows us to compute the 
gravitational constant G from the period, the total mass (essentially the mass of the sun) and the geometry of 
the ellipse. 

The case e > 1 corresponds to a negative G, where particles repell each other. The third Kepler law does then 
no more apply and ellipses become a hyperbola. The second law is unchanged. 



REMARKS. 

If the force is changed to x" = —mGx/r a , (note that a = 3 is the Kepler case), then the second Kepler law 
still applies, the other two laws not. The formula 9 = L/(mr 2 ) still applies. Also the derivation of the formula 
for x" ■ n = r" — L 2 /(m 2 r 3 ) is valid. The left hand side is -mG/r a_1 , which leads to the ordinary differential 
equation r" = -mG/r a_1 + L 2 /(m 2 r 3 ) for r(t). Knowing r(t) gives then 9(t) from 9 = L/(mr 2 ). The global 
behavior depends on the constants G, L. If a = 4, which corresponds to the two body problem in 4 dimensions, 
then r" = C/r 3 , where C = —mG-\-L 2 /m 2 is a constant. If C > 0, the bodies seperate to infinity. If C < 0, then 
r(t) — > 0. Only if the angular momentum is such that C = 0, there is a bounded motion. Stable planetary 
systems would not exist in four dimensions. A theorem of Bertant states that only for a = 3 (the Kepler 
case) and a = — 1 (the harmonic oscillator), all bounded orbits are periodic. 

COLLISIONS. 

If the two podies collide, we get a collision singularity. Collisions can occur in the 2-body problem, if the 
total angular momentum of the two bodies is zero. Analysing collision singularities involving more than two 
bodies helps to understand what happens when particles are close to such collision configurations. It is known 
that initial conditions leading to collisions are rare in the n-body problem. Noncollision singularities in which 
particles escape to infinity in finite time exist already for the 5-body problem. 

Our galaxy and M31, the Andromeda galaxy, form a relatively isolated system known as the local group. The 
center of mass of M31 approaches the center of mass our galaxy with a velocity of 119 km/s. In about 10 10 
years, these galaxies are likely to collide. Such a collision will have dramatic consequences for both systems. 
Nevertheless, even a direct encounter would probably not lead to any collision of any two stars. 



SOME HISTORY. 

• Nicolas Kopernicus (1473-1543) had a heliocentric system. 

• Galileo Galilei (1564-1642) discovers Jupiter moons. 

• Johannes Kepler (1571-1630) builds on the observations of Tycho 
Brahe. Finds the first and second Kepler Law in 1609, the third 
in 1619. 

• Before Newton, the dynamics of celestial objects was described em- 
pirically, first by circular, later by epicycle approximations which 
effectively where Fourier approximations of the actual elliptic mo- 
tion. 

• Issac Newton (1643-1727) led the foundations of mathematical 
description and developed calculus simultaneously with Leibnitz. 
Newton had already solved the Kepler problem geometrically. 

• Johan Bernoulli proved in 1710 that solutions to the 2 body prob- 
lem move on conic sections. 

• Laplace, Lagrange, Hamilton and Poisson belong to the ancestors 
of celestial mechanics, the study of the n-body problem. 

• With Poincare (1854-1912) at the end of the 19'th century, and 
Birkhoff (at Harvard!), the subject of the three body problem was 
studied with new geometric and topological methods. 




Lecture 5: ARC LENGTH 



Math21a 



HOMEWORK: 10.6: 8,14,20,22 



PLANE CURVE 

r(t) = (x(t),y(t)) position 
r'(t) = (x'(t), y'(t)) velocity 
\r'(t)\ = (x'(t), y'(t)) speed 

r "(t) = (x"(t),y"(t)) acceleration f '"(t) = 
(x"'(t),y"'(t)))erk 



SPACE CURVE 

r(t) = (x(t),y(t), z(t)) position 
f '(£) = (x'(t),y'(t),z'(t)) velocity 
\r'(t)\ = (x'(t), y'(t)) speed 

r"\t) = (x"(t),y"(t),z"(t)) acceleration r"'(t) - 
(x'"(t),y'"(t),z'"(t))ierk 



ARC LENGTH. If t G [a, b] h-> f(t) with velocity v(t) = r'(t) and speed \v(t)\, then f* \v(t)\ dt is called the arc 
length of the curve. For space curves this is 



PARAMETER INDEPENDENCE. The arc length is independent of the parameterization of the curve. 



REASON. Changing the parameter is a change of variables (substitution) in the integration. 

EXAMPLE. The circle parameterized by f(t) = (cos(£ 2 ), sin(£ 2 )) on t = [0, y/2ir] has the velocity r '(t) 



2t(- sin(£),cos(£)) and speed 2t. The arc length is f/^2t dt = t 2 \^ 2n = 2ir. 



REMARK. Often, there is no closed formula for the arc length of a curve. For example, the Lissajoux 
figure f(t) = (cos(3£), sin(5£)) has the length J Q 27r yj 9 sin 2 (3t) + 25 cos 2 (5£) dt. This integral must be evaluated 
numerically. If you do the Mathematica Lab, you will see how to do that with the computer. 



THE MATERIAL BELOW IS NOT PART OF THIS COURSE. 



CURVATURE. 

T(t) =r'(t)\/\r'(t)\ unit tangent vector 
n{t) = \f'{t)\/\r'{t)\ curvature 



CURVATURE FORMULA 
((a, b) x (c, d)=ad- be in 2D) 



k(+ \ _ \r'(t)xf "(t)\ 
- \r '(t)|3 



EXAMPLE. CIRCLE 

f{t) = (r cos(t), r sin(t)). 
fit) = (— r sin(t),r cos(t)). 
\f 'it)\ = r. 

f(t) = (-sin(t),cos(t)). 
f"{t) = (-rcos(t),-rsin(t)). 
f'{t) = (-cos(t),-sin(t)). 
K (t) = \r(t)\/\f(t)\ = l/r. 



EXAMPLE. HELIX 

f(t) = (cos(t),sin(t),t). 
f'(t) = (-sin(t),cos(t),l). 
\f(t)\ = (-sin(£),cos(£),l) = ^2. 
f(t) = (- sin(t), cos(t), 1)/V2. 
f"{t) = (-cos(t),-sin(t),0). 
f'{t) = {-cos{t),sm{t),G)/y/2. 
«(*) = \f'{t)\/\f >{t)\ = 1/2. 



INTERPRETATION. 

If s(t) = J* \f(t)\ dt, then s'(t) = ds/dt = \f(t)\. Because 

f'{t) = df/dt = df/ds ■ ds/dt, we have 

\df/ds\ = \ f'(t)\/\f(t)\ = K (t). 




"The curvature is the length of the acceleration vec- 
tor if f(t) traces the curve with constant speed 1." 



A large curvature at a point means that the curve is strongly 
bent. Unlike the acceleration or the velocity, the curvature does 
not depend on the parameterization of the curve. You "see" the 
curvature, while you "feel" the acceleration. 




Large 
curvature 

K = 

1/r = 2 



CURVATURE OF A GRAPH. 

The curve f(t) = (t,f(t)), which is the graph of a function / 

has the velocity f '(t) = (1, /'(£)) and the unit tangent vector \ - 5 

f(t) = (l,/ , (t))/ x /l + / , (t)2 2 and after some simplification 




n(t) = \T'{t)\/\f{t)\ = \r [t)\/ y/i + f(tr i 


2 \ 4 6/ 8 

\ / 


EXAMPLE. f{t) = sin(t), then n{t) = \ sin(t)|/| y/l + cos 2 (t) 3 . 

-i 



TANGENT/NORM AL/BINORMAL. 
T(t) = |^/^]| tangent vector 



Nit) 



= T'(t) 
\T'(t)\ 



unit normal vector 



Bit) = T{t) x N(t) binormal vector 

Because f{t)-f{t) = 1, we get after differentiation f'(t)-f(t) = 
and Nit) is perpendicular to Tit). 




The three vectors (T(t), Nit), Bit)) are unit vectors orthogonal to each other. 



Note. In order that (T \t), Nit), Bit)) exist, we need that fit) is not zero. 



WHERE IS CURVATURE NEEDED? 




OPTICS. If a curve r(£) represents a wavefront and n(£) is a unit 
vector normal to the curve at r(£), then s(£) = r(£) + n(t)/«;(£) 
defines a new curve called the caustic of the curve. Geometers 
call that curve the evolute of the original curve. 




HISTORY. 



Aristotle: (350 BC) distinguishes between straight lines, circles and "mixed behavior" 
Oresme: (14'th century): measure of twist called "curvitas" 
Kepler: (15'th century): circle of curvature. 

Huygens: (16'th century): evolutes and involutes in connection with optics. 

Newton: (17'th century): circle has constant curvature inversely proportional to radius, (using infinitesimals) 
Simpson: (17'th century): string construction of evolutes, description using fluxions. 
Euler: (17'th century): first formulas of curvature using second derivatives. 
Gauss: (18'th century): modern description, higher dimensional versions. 



COMPUTING CURVATURE WITH MATHEMATICA 



*[t_]:=Cos[3 t]; 
a[t_]':=Sin[5 th 
r[t_h={x[t],y[t]h 
dr[t_]:=D[r[s],s] / + s->t; 
L[{a_,b_}]:=Sqrt[a"2+b"2h 
T[t_]:=dr[t]/L[dr[t]]; 
dT[tL];=D[T[s],s] / t s-Hj 
kap P a[t_]:=L[dT[t]]/L[dr[t]] 




Lecture 7: 2/25/2004, FUNCTIONS AND LEVEL CURVES Math21a, O. Knill 



Section 11.1: 4(abc) 6(abc), 14,16,18,42, Section 11.3: 6 18 



FUNCTIONS, DOMAIN AND RANGE. We deal with functions f(x,y) of two variables denned on a domain 
D. The domain is usually the entire plane like for f(x,y) = x 2 + sm.{xy). But there are cases like in f(x,y) = 
l/\/l — {x 2 + y 2 ), where the domain is a subset of the plane. The range is the set of possible values of /. 



LEVEL CURVES 

2D: If f(x,y) is a function of two variables, then 
f{x,y) = c = const is a curve or a collection of 
curves in the plane. It is called contour curve or 
level curve. For example, f(x,y) = 4x 2 + 3y 2 = 1 
is an ellipse. Level curves allow to visualize functions 
of two variables f(x,y). 



LEVEL SURFACES. We will later see also 3D ana- 
logues: if f(x,y,z) is a function of three variables 
and c is a constant then f(x,y,z) = c is a surface 
in space. It is called a contour surface or a level 
surface. For example if f(x, y, z) = 4x 2 + 3y 2 + z 2 
then the contour surfaces are ellipsoids. 



EXAMPLE. Let f(x,y) = x 2 — y 2 . The set x 2 — y 2 = is the union of the sets x = y and x = —y. The set 
x 2 — y 2 = 1 consists of two hyperbola with with their tips at (—1,0) and (1,0). The set x 2 — y 2 = —1 consists 
of two hyperbola with their tips at (0, ±1). 



EXAMPLE. Let f(x,y,z) = x 2 + y 2 - z 2 . f(x,y,z) = 
0, f(x, y, z) = 1, f(x, y, z) = —1. The set x 2 -\-y 2 — z 2 = is a cone 
rotational symmetric around the z-axis. The set x 2 + y 2 — z 2 = 1 
is a one-sheeted hyperboloid, the set x 2 + y 2 — z 2 = — 1 is a 
two-sheeted hyperboloid. (To see that it is two-sheeted note 
that the intersection with z = c is empty for — 1 < z < 1.) 




CONTOUR MAP. Drawing several contour curves {f(x,y) = c] or sev- 
eral contour surfaces {f(x,y,z) = c} produces a contour map. 




The example shows the graph of the function f(x,y) = sm(xy). We draw the contour map of /: The curve 
sin(xy) = c is xy = C, where C = arcsin(c) is a constant. The curves y = C/x are hyperbolas except for C — 0, 
where y = is a line. Also the line x = is a contour curve. The contour map is a family of hyperbolas and 
the coordinate axis. 



TOPOGRAPHY. Topographical maps often show the curves of equal height. With the contour curves as 
information, it is usually already possible to get a good picture of the situation. 



EXAMPLE. f(x,y) = 1 - 2x 2 - 
y 2 . The contour curves f(x,y) = 
1 — 2x 2 + y 2 = c are the ellipses 
2x 2 + y 2 = 1 - c for c < 1. 
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SPECIAL LINES. Level curves are encountered every day: 



Isobars: 


pressure 


Isoclines: 


direction 



Isothermes: 


temperature 


Isoheight: 


height 



For example, the isobars to the right show the lines of constant temper- 
ature in the north east of the US. 




A SADDLE. f(x,y) = {x 2 - 
y 2 )e~ x ~ y . We can here no more 
find explicit formulas for the con- 
tour curves (x 2 — y 2 )e~ x ~ y = c. 
Lets try our best: 




• f(x, y) = means x 2 — y 2 = so that x = y,x = —y are contour curves. 

• On y = ax the function is g(x) = (1 — a 2 )x 2 e~^ 1+a2 ^ x ' 2 . 

• Because f(x, y) = f{—x, y) = f(x, —y), the function is symmetric with respect to reflections at the x and 
y axis. 



A SOMBRERO The su rface z = 
f(x,y) = sin(y^x 2 + y 2 ) has cir- 
cles as contour lines. 





ABOUT CONTINUITY. In reality, one sometimes has to deal with functions which are not smooth or not 
continuous: For example, when plotting the temperature of water in relation to pressure and volume, one 
experiences phase transitions, an other example are water waves breaking in the ocean. Mathematicians 
have also tried to explain "catastrophic" events mathematically with a theory called "catastrophe theory". 
Discontinuous things are useful (for example in switches), or not so useful (for example, if something breaks). 



DEFINITION. A function f(x,y) is continuous at (a, 6) if f(a,b) is finite and lim^^)^^^) f(x,y) = /(a, b). 
The later means that that along any curve f(t) with r(0) = (a, 6), we have hm^ f{r{t)) = /(a, b). 
Continuity for functions of more variables is defined in the same way. 



EXAMPLE. f(x,y) = {xy)/{x 2 + y 2 ). Because 
lim( a . )a; )_ > (o ) o) f{x, x) = lim^o x 2 / (2x 2 ) = 1/2 and 
lim( aj0 )-Ko,o)/(0,aO = lim (x,o)^(o,o) = 0. The 
function is not continuous. 



EXAMPLE. f(x,y) = {x 2 y)/{x 2 + y 2 ). In polar 
coordinates this is f{r,9) = r 3 cos 2 (#) sin(#)/r 2 = 
r cos 2 (6>) sin(0). We see that f{r,9) — >■ uniformly 
if r — ► 0. The function is continuous. 



Lecture 8: 2/26/2004, PARTIAL DERIVATIVES 



Math21a, O. Knill 



HOMEWORK. 11.3: 6 18 (if not done yet), 40,42,66,70 



PARTIAL DERIVATIVE. If f(x,y) is a function of two variables, then J^f(x,y,z) is denned as the deriva- 
tive of the function g(x) = f(x, y, z), where y is fixed. The partial derivative with respect to y is defined similarly. 

NOTATION. One also writes f x (x,y) = J^f(x,y) etc. For iterated derivatives the notation is similar: for 
example f xy = £^f. 

REMARK. The partial derivatives d x f, d y f measure the rate of change of the function in the x or y directions. 
For functions of more variables, the partial derivatives are defined in a similar way. 



EXAMPLE. f(x,y) = x 4 - 6x 2 y 2 + y 4 . We have f x (x,y) = 4x 3 - 
12xy 2 J xx = 12x 2 -12y 2 J y (x : y) = -12x 2 y + 4y 3 J yy = -12x 2 + 12y 2 . 
We see that f xx + f yy = 0. A function which satisfies this equation is 
called harmonic. The equation f xx + f yy = is an example of a 
partial differential equation. 




CLAIROT THEOREM. If f xy and f yx are both continuous, then f xy = f yx . Proof. Compare the two sides: 



dxf x (x, y) ~ f(x + dx, y) - f(x, y) 

dydxf xy (x,y) ~ f(x + dx 1 y + dy)-f(x + dx,y + dy)- 

(f(x + dx,y) - f(x,y)) 



dyf y {x, y) ~ f(x, y + dy) - f(x, y). 

dxdyf yx (x,y) ~ f(x + dx,y + dy)-f(x + dx,y) - 

(f(x,y + dy) - f(x,y)) 



CONTINUITY IS NECESSARY. Example: f(x, y) = (x 3 y - xy 3 )/(x 2 + y 2 ) contradicts Clairot: 



f x (x,y) = (Sx 2 y - y 3 )/{x 2 + y 2 ) - 2x{x 3 y ■ 
xy 3 )/(x 2 + y 2 ) 2 , f x (0, y) = -y, f xy (0, 0) = -1, 



f y {x,y) = {x 3 - 3xy 2 )/{x 2 + y 2 ) - 2y{x 3 y ■ 
xy 3 )/{x 2 + y 2 ) 2 , f y (x, 0) = x, f y>x (0, 0) = 1. 



\f(x,y) 



\fx(x,y) 



fxy{x,y) 







GRADIENT. If f(x,y,z) is a function of three variables, then 

vf(x, y, z) = (J^ffa y, z), -^f( x , y> z ), Yz^ x ' Vl z ^ 

is called the gradient of /. The symbol V is called Nabla. It is named after an Egyptian harp, the Hebrew 
word "nevel"=harp seems to have the same aramaic origin). We will talk about the gradient in detail next 
week. 



NORMAL. As we will see later, the gradient S7f(x,y) is orthog- 
onal to the level curve f(x, y) = c and the gradient V/(x, y, z) is 
normal to the level surface f{x 1 y 1 z). For example, the gradient 
of f(x, y, z) = x 2 + y 2 — z 2 at a point (x, y, z) is (2x, 2y, —2z). 



4 



WHY ARE PARTIAL DERIVATIVES IMPORTANT? 

• Geometry. For example, the gradient S7f(x,y,z) is a vector normal to a surface at the point (x,y,z). 
Tangent spaces. 

• Approximations, linarizations. 

• Partial differential equations: laws describing nature. 

• Optimization problems, as we will see later. 

• Solution to some integration problems using generalizations of fundamental theorem of calculus. 

• In general helpful to understand and analyze functions of several variables. 



PARTIAL DIFFERENTIAL EQUATIONS. An equation which involves partial derivatives of an unknown 
function is called a partial differential equation. If only the derivative with respect to one variable appears, 
it is called an ordinary differential equation. 

1) fxx(x,y) = f yy (x,y) is an example of a partial differential equation 1) f x (x,y) = f xx {x,y) is an example of 
an ordinary differential equation. The variable y can be considered as a parameter. 



LAPLACE EQUATION fiify 


^^^^^^ADVECTION EQUA- ^^^^^^^^^^ 


fxx ~\~ fyy — |. 




f(x,t) = x 2 -y 2 . \% 





WAVE EQUATION jt 




fu = fxx ■ f(t, x) = \A/xX7\ 


^^^^^^ HEAT EQUATION /^^^^^^^^ 


sin(x — t) + sm.(x + t) ^OSO^ 





"A great deal of my work is just playing with equations and 
seeing what they give. I don't suppose that applies so much to 
other physicists; I think it's a peculiarity of myself that I like to 
play about with equations, just looking for beautiful mathe- 
matical relations which maybe don't have any physical meaning 
at all. Sometimes they do." - Paul A. M. Dirac. 




Dirac discovered a PDE describing the electron which is consistent both with quantum theory and special relativity. 
This won him the Nobel Prize in 1933. Dirac's equation could have two solutions, one for an electron with positive 
energy, and one for an electron with negative energy. Dirac interpretated the later as an antiparticle: the existence of 
antiparticles was later confirmed. 



Lecture 9: 3/1/2004, CHAIN RULE 



Math21a, O. Knill 



HOMEWORK. Section 11.4: 8,28,36,38,46. 



REMEMBER? If / and g are functions of one variable £, then d/dtf(g(t)) = f'(g(t))g'(t). For example, 
d/dtsin(log(t)) = cos(log(t))/t. 

GRADIENT. Define Vf(x,y) = {f x (x, y), f y (x, y)). It is called the gradient of /. 

THE CHAIN RULE. If fit) is curve in space and / is a function of three variables, we get a function of one 
variables t ^ f(f(t)). 

The chain rule looks like the ID chain rule, but where the derivative /' is replaced with the gradient V/ = 



(/*, f y , f z ). d/dtf(f(t)) = vmt)) ■ 



WRITING IT OUT. Writing the dot product out gives 
d 



dt f{x{t),y(t)) = f x (x(t),y(t))x'(t) + f y (x(t),y(t))y'(t) . 



EXAMPLE. Let z = sin(x + 2y), where x and y are functions of t: x = e*, y = cos(t). What is ? 

Here, z = f(x,y) = sin(x + 2y), z x = cos(x + 2y), and z y = 2cos(x + 2y) and ^ = e t , ^ = — sin(t) and 
4f = cos(rr + 2y)e x — 2 cos{x + 2y) sin(t). 



EXAMPLE. If / is the temperature distribution in a room and f(t) 
is the path of the spider Shelob, then f{f(t)) is the temperature, 
Shelob experiences at time t. The rate of change depends on the 
velocity f{t) of the spider as well as the temperature gradient V/ 
and the angle between gradient and velocity. For example, if the 
spider moves perpendicular to the gradient, its velocity is tangent 
to a level curve and the rate of change is zero. 



EXAMPLE. A nicer spider called "Nabla" moves along a circle f(t) = (cos(t), sin(t)) on a table with 
temperature distribution T(x,y) = x 2 — y 3 . Find the rate of change of the temperature, "Nabla" experiences. 

SOLUTION. VT{x,y) = {2x,-3y 2 ), f{t) = (- sin(t), cos(t)) d/dtT(f(t)) = VT(f(t)) • f(t) = 
(2cos(t),-3sin(£) 2 ) • (- sin(t), cos(t)) = -2cos(£) sin(t) -3sin 2 (t) cos(t). 

APPLICATION ENERGY CONSERVATION. If H(x, y) is the energy of a system, the system moves satisfies 



the equations x'(t) = H y ,y'(t) = —H x . For example, if H(x : y) = y 2 /2 + V{x) is a sum of kinetic and potential 
energy, then x'{t) = y,y'(t) = V'{x) is equivalent to x"{t) = —V'{x). In the case of the Kepler problem, we 
had V(x) = Gm/\x\. | The energy H is conserved"] . Proof. The chain rule shows that d/dtH(x(t),y(t)) 
H x {x,y)x'{t) + H y (x,y)y'(t) = H x (x,y)H y (x,y) - H y (x,y)H x (x,y) = 0. 

APPLICATION: IMPLICIT DIFFERENTIATION. 

From f(x, y) = one can express y as a function of x. From d/df(x, y{x)) = V/ • (1, y'(x)) = f x + f y y' = we 



obtain y' = -f x /f y . 



EXAMPLE. f(x,y) = x 4 + xsin(xy) = defines y = g(x). If f(x,g(x)) = 0, then g x (x) = -f x /f y = 
— {4.x 2, + sm(xy) + xy cos(xy)) / (x 2 cos(xy)). 

APPLICATION: DIFFERENTIATION RULES (OSCAR NIGHT). One ring of the 
chain rules them all! 

f(x, y) = x + y, x = u(t),y = v(t),d/dt(x + y) = f x u f + f y v' = u' + v'. 
f(x, y) = xy, x = u(t), y = v(t), d/dt(xy) = f x u' + f y v' = vu' + uv' . 
f(x, y) = x/y, x = u(t),y = v(t),d/dt(x/y) = f x u' + f y v' = u'/y - v'u/v 2 . 




DIETERICI EQUATION. In thermodynamics the temperature T, the pressure p and the volume V 
of a gase are related. One refinement of the ideal gas law pV = RT is the Dieterici equation 
f(p,V,T) = p(V — b)e a / RVT — RT = 0. The constant b depends on the volume of the molecules and a 
depends on the interaction of the molecules. (A different variation of the ideal gas law is van der Waals law). 
Problem: compute Vt- 



If V = V(T,p), the chain rule says f v V T + f T = 0, so that 
R)/(pVe a / RVT -p(V- b)e a / RVT /{RV 2 T)). (This could be simplified to (R + a/TV)/(RT/(V - b) - a/V 2 )). 



-fr/fv = -(-MV ~ b)e a ' RVT /{RVT 2 ) - 



dXjJ 



MORE DERIVATIVES. (The last three derivatives will only appear later) 



/ : R - R 3 


/ : R 3 - R 


/ : R 2 - R 3 


/ : R 2 ^ R 2 


/ : R 3 -> R 3 


curve 


scalar function 


surface 


coordinate change 


gradient field 


/' velocity vector. 


/' gradient vector. 


/' tangent matrix 


/' Jacobean matrix. 


/' Hessian matrix. 



THE GENERAL CHAIN RULE, (for people who have seen some linear algebra). First of all, the 
d/dtf{f{t)) = Vf(r(t)) ■ ?(t) is true in any dimension. If / is vector valued, the same equation holds for each 
component d/dtfi(r(t)) = X7fi(r(t)) -r*(£). One can further assume that f depends on different variables. Then 
this formula holds for each variable Xi. Here is the general chain rule for the curious: If / : R n — > R m and 
g : R fc — >■ R n , we can compose fog, which is a map from R fc to R m . The chain rule expresses the derivative 
of / o g{x) = f(g(x)) in terms of the derivatives of / and g. 



or short 



\{fog)'{x) = f'{g{x))-g'{x)\ 



Both f'(g(x)) and g'(x) are matrices and • is the matrix multiplication. The chain rule in higher dimensions 
looks like the chain rule in one dimension, only that the objects are matrices and the multiplication is matrix 
multiplication. 



EXAMPLE. GRADIENT IN POLAR COORDINATES. In polar coordinates, the gradient is defined as V/ = 
(fr, fe/r). Using the chain rule, we can relate this to the usual gradient: d/drf(x(r, 9), y(r, 9)) = f x (x, y) cos(#)+ 
f y (x, y) sin(0) and dj \rd9) f (x(r, 9) , y(r, 9)) = —f x (x,y)sin(9) + f y (x, y) cos(#) means that the length of V/ is 
the same in both coordinate systems. 



PROOFS OF THE CHAIN RULE. 

1. Proof. Near any point, we can approximate / by a linear function L. It is enough to check the 
chain rule for linear functions f(x,y) = ax + by — c and if f(t) = (xo + tu, yo + tv) is a line. Then 
^/(f(t)) = ^(a(xo + tu) + b(yo +tv)) = au + bv and this is the dot product of V/ = (a, b) with f'(t) = (u, v). 

2. Proof. Plugging in the definitions of the derivatives and use limits. 



WHERE IS THE CHAIN RULE NEEDED? (informal). 

While the chain rule is useful in calculations using the composition of functions, the iteration of maps or in 
doing change of variables, it is also useful for understanding some theoretical aspects. Examples: 

• In the proof of the fact that gradients are orthogonal to level surfaces, (see Wednesday). 

• It appears in change of variable formulas. 

• It will be used in the fundamental theorem for line integrals coming up later in the course. 

• The chain rule illustrates also the Lagrange multiplier method which we will see later. 

• In fluid dynamics, PDE's often involve terms u t + uVu which give the change of the velocity in the 
frame of a fluid particle. 

• In chaos theory, where one wants to understand what happens after iterating a map /. 



Lecture 10: 3/4/2004, DIRECTIONAL DERIVATIVE 



Math21a 



HOMEWORK. 11.5: 22,26,42,56 



REVIEW. CHAIN RULE. The chain rule in multivariable calculus is 

where V/ = (f x , f y ) in two dimensions or V/ = (f xi f yi f z ) in three dimensions. 



DIRECTIONAL DERIVATIVE. 

If / is a function of several variables and v is a vector, then V/ • v is called the directional derivative of / 
in the direction v. One writes V#/ or D$f. 



D v f(x, y, z) = V/0, y,z)-v 



It is usually assumed that v is a unit vector but we do not insist on that. Using the chain rule, one can write 
iD,f=f t f(x + tv). 



EXAMPLE. PARTIAL DERIVATIVES ARE SPECIAL DIRECTIONAL DERIVATIVES. 

If v = (1, 0, 0), then D$f = V/ • v — f x . The directional derivative is a generalization of the 

If v — (0, 1, 0), then D$f = V/ • v — f y . partial derivatives. Like the partial derivatives, it is 

If v = (0, 0, 1), then Dtff = Vf-v = f z . a scalar. 



EXAMPLE. DIRECTIONAL DERIVATIVE ALONG A CURVE. 

If / is the temperature in a room and f(t) is a curve with velocity f'(t), then V/(r(t)) -f'it) is the temperature 
change, one measures on the point moving on a curve f(t) experiences: the chain rule told us that this is 
d/dtf(f(t))- 



GRADIENTS AND LEVEL CURVES/SURFACES. 



Gradients are 
orthogonal to 
level curves and level 
surfaces. 



Every vector x — xq in the 
tangent line satisfies V/ • 
(x — xq) = and is so 
orthogonal to V/. The 
same argument holds for 
surfaces (see Friday). 




STEEPEST DECENT. The directional derivative satisfies 

I At/| < \Vf\\v\ 

because V/ • v = |V/||v|| cos(0)| < |V/||v|. The direction v = V/ is the direction, where / increases most, 
the direction —V/ is the direction where / decreases most. It is the direction of steepest decent. 

IN WHICH DIRECTION DOES f INCREASE? Uv = V/, then the directional derivative is V/ • V/ = | V/| 2 . 
This means that / increases, if we move into the direction of the gradient! 



EXAMPLE. You are on a trip in a air-shop at (1, 2) and want to 
avoid a thunderstorm, a region of low pressure. The pressure is 
given by a function p(x, y) = x 2 + 2y 2 . In which direction do you 
have to fly so that the pressure change is largest? 

Parameterize the direction by v = (cos(0), sin ( </!>)). The pressure gradient is X7p(x, y) = (2x, 4y). The directional 
derivative in the 0-direction is Vp(x,y) ■ v = 2cos(0) + 4sin(0). This is maximal for — 2sin(0) + 4cos(0) = 
which means tan(0) = 1/2. 



ZERO DIRECTIONAL DERIVATIVE. The rate of change in all directions is zero if and only if Vf(x, y) = 0: 
if V/ 7^ 0, we can choose v = V/ and get -Dv// = IW| 2 . 



We will see later that points with V/ = are candidates for local maxima or minima of /. Points (x, y), where 
Vf(x,y) = (0,0) are called stationary points or critical points. Knowing the critical points is important to 
understand the function /. 



PROPERTIES DIRECTIONAL DERIVATIVE. 


PROPERTIES GRADIENT 


D v (Xf) = XD v (f) 


V(A/) = AV(/) 


D v {f + g) = D v {f)+D v {g) 


V(/ + s) = V(/) + V(fir) 


D v {fg) = D v (f)g + fD v (g) 


V(/<?) = V(f)9 + fV(g) 



THE MATTERHORN is a popular climbing mountain in the Swiss alps. Its 
height is 4478 meters (14,869 feet). It is quite easy to climb with a guide. There 
are ropes and ladders at difficult places. Even so, about 3 people die each year 
from climbing accidents at the Matterhorn, this does not stop you from trying 
an ascent. In suitable units on the ground, the height f(x, y) of the Matterhorn 
is approximated by f(x,y) = 4000 - x 2 - y 2 . At height /(-10, 10) = 3800, 
at the point (—10,10,3800), you rest. The climbing route continues into the 
north-east direction v = (1,-1). Calculate the rate of change in that direction. 
We have Vf(x, y) = (-2a;, -2y), so that (20, -20)-(l, -1) = 40. This is a place, 
with a ladder, where you climb 40 meters up when advancing lm forward. 

THE VAN DER WAALS (1837-1923) equation for real gases is 

{p + a/V 2 ){V-b) =RT{p,V) , 

where R = 8.314 J/ifmol is a constant called the Avogadro number. This 
law relates the pressure p, the volume V and the temperature T of a gas. The 
constant a is related to the molecular interactions, the constant b to the finite 
rest volume of the molecules. 

The ideal gas law pV = nRT is obtained when a, b are set to 
0. The level curves or isotherms T(p, V) = const tell much 
about the properties of the gas. The so called reduced van 
der Waals law T(p, V) = (p + 3/V 2 ){3V - l)/8 is obtained by 
scaling p, T, V depending on the gas. Calculate the directional 
derivative of T(p, V) at the point (p,V) = (1,1) into the direction 
v = (1,2). We have T p (p,V) = {W - l)/8 and T v (p,V) = 
3p/8- (9/8)l/y 2 - 3/(4y 3 ). Therefore, VT(1, 1) = (1/4,0) and 
D„r(l,l) = l/5. 



TANGENT LINE. Because n = Vf(xo,yo) = (a,b) is perpendicular to the level curve f(x,y) = c through 
(xo,yo), the equation for the tangent line is 

ax + by = d, a = f x (x , y ), b = f y (x , y ), d = ax + by 

EXAMPLE. The isotherme in the previous example through (1,1) has there the tangent (l/4)x + • y = 
(1/4)1 + 0-1 = 1/4 which is the horizontal line x — 1. 







3/5/2004, THE SKULL SURFACE, Oliver Knill 



, Math21a 




Surfaces are usually represented in a triangulated 
form, where a few points on the surface are given 
and triples of points form triangles. This illustrates 
the concept of linear approximation. Every 
triangle is close to the actual surface. 



HOW TO PRODUCE THE SKULL SURFACE. 

We found on the web a file 
skull.dat, which contains data 



-0.02 0.02 
0.29 0.65 0.08 
0.4 0.53 0.06 
-0.02 0.02 
0.4 0.53 0.06 
0.53 0.41 0.02 
-0.02 0.02 



These data encode points 
in space: every line is a 
point, three points in a 
row represent a triangle in 
space. 



So, the first 3 lines from 
the datafile represent the 
triangle {P,Q,P} = 
{(0,-0.02,0.02), 
(0.29,0.65,0.02), 
(0.4,0.65,0.08) }. In to- 
tal, the file contains 3729 
points which means that 
there are 1243 triangles 
encoded. 



PLOTTING THE SURFACE. 

To the right is a small Mathemat- 
ica program which produces a sur- 
face from these data. The data are 
first read, then packed into groups 
of 3 to get the points, then packed 
into groups of three to get the tri- 
angles, then each triple of points is 
made into a polygon. All these poly- 
gons are finally displayed. 



A=ReadList [" skull.dat" , Number] ; 
B=Partition[A,3]; 
P=Partition[B,3]; 
U=Map[Polygon,P]; 

Show [Graphics3D[U], Boxed -> False, ViewPoint -> {0, -3,0}] 



PLOTTING THE GRADIENTS. 

We can get the gradient vec- 
tor on each triangle {Pi,P 2 ,P 3 } 
by taking the cross product 
n = (Pi -P 2 ) x (P 3 -P 2 ) and adding 
lines connecting Q = (Pi + P 2 + P3 ) 
with the point Q + n. We used that 



Gradients are orthogonal 
to the level surfaces. 



A=ReadList [" skull.dat" , Number] ; 
B=Partition[A,3]; 
P=Partition[B,3]; 
U=Map [Polygon, P]; 

NormedCross[a_,b_]:=Module[{},u=Cross[a,b]; u/Sqrt[u.u]]; 
NormalVector[{ a_,b_,c_ }]:=Module[{},Pl=(a+b+c)/3; 
P2=P 1-NormedCross [b-a,c-a] /3 ;Line [{P 1 ,P2 }] ] ; 
V=Map[NormalVector,P] ; 

Show[Graphics3D[{U,V}], Boxed- > False, ViewPoint- > {0,-3,0}] 




Here is the output of the above routine. 
Fortunately, the points of each triangle 
were given in such a way that we also 
know the direction of the normal vector. 
We have chosen the direction pointing 
outwards. Through every triangle, one 
gradient vector is displayed. 



Once we have the data in Mathematica, we can also export a file, which a raytracer can read. We could also 
export to a VRML (Virtual Reality Markup Language) and fly around or through it. The ultimate Haloween 
experience! 



Lecture 11: 3/5/2004, TANGENT PLANES 



Math21a 



HOMEWORK. 11.5: 34,38,58,60 



REMINDER: TANGENT LINE. Because ft = V/(x , y ) = (a, b) is perpendicular to the level curve f(x, y) = c 
through (xo,yo), the equation for the tangent line is 



ax + by = d, a = f x (x 0: y ), b = f y (x ,y ), d = ax + l 



Example: Find the tangent to the graph of the function g(x) = x 2 
at the point (2, 4). Solution: the level curve f(x, y) = y — x 2 = 
is the graph of a function g(x) = x 2 and the tangent at a point 
(2,g(2)) = (2,4) is obtained by computing the gradient (a,b) = 
V/(2, 4) = (-g'{2), 1) = (-4, 1) and forming -Ax + y = d, where 
d = —4 -2 + 1-4 = —4. The answer is | — Ax + y = —4 | which is the 
line y = Ax — 4 of slope 4. Graphs of ID functions are curves in 
the plane, you have computed tangents in single variable calculus. 




GRADIENT IN 3D. If f(x,y,z) is a function of 
(f x (x,y,z),f y (x,y,z),f z (x,y,z)) is called the gradient of /. 



three variables, then Vf(x,y,z) = 



POTENTIAL AND FORCE. Force fields F in nature often are gradients of a function U(x, y, z) 
U is called a potential of F or the potential energy. 



The function 



EXAMPLE. If U(x,y,z) = l/\x\, then VU(x,y,z) = -x/\x\ 3 . The function U{x,y,z) is the Coulomb poten- 
tial and VU is the Coulomb force. The gravitational force has the same structure but a different constant. 
While much weaker, it is more effective because it only appears as an attractive force, while electric forces can 
be both attractive and repelling. 



LEVEL SURFACES. If f(x,y,z) is a func- 
tion of three variables, then f(x,y,z) = C 
is a surface called a level surface of /. The 
picture to the right shows the Barth surface 
(3 + 5t)(-l + x 2 + y 2 + z 2 ) 2 {-2 + t + x 2 + 
y 2 + z 2 ) 2 8(x 2 - t 4 y 2 )(-t 4 x 2 + z 2 ) (y 2 - t 4 z 2 ) 
(x 4 - 2x 2 y 2 + y 4 - 2x 2 z 2 - 2y 2 z 2 + z 4 ) = 0, 
where t = (y/E-l)/2 is the golden ratio. 




ORTHOGONALITY OF GRADIENT. We have seen 
that the gradient Vf(x,y) is normal to the level curve 
f(x,y) = c. This is also true in 3 dimensions: 



The gradient Vf(x,y,z) is nor- 
mal to the level surface f(x, y, z). 



The argument is the same as in 2 dimensions: take a curve 
r(t) on the level surface. Then ^f(r(t)) = 0. The chain 

rule tells from this that Vf(x, y, z) is perpendicular to the 
velocity vector f*(t). Having V/ tangent to all tangent 

velocity vectors on the surfaces forces it to be orthogonal. 




EXAMPLE. The gradient of f(x, y, z) = x 2 + 2y 2 + z 2 at 
a point (x, y, z) is (2x, 4y, 2z). It illustrates well that going 
into the direction of the gradient increases the value of the 
function. 




TANGENT PLANE. Because n = V/(x , y , z Q ) = {a, b, c) 
is perpendicular to the level surface f(x, y,z) = C through 
(xo, yo, zo), the equation for the tangent plane is 

ax + by + cz = d, (a, 6, c) = 
Vf(x , y , z ), d = ax + by + cz . 




EXAMPLE. Find the general formula for the tangent plane 
at a point (x,y,z) of the Barth surface. Just kidding ... 
Note however that computing this would be no big deal with 
the help of a computer algebra system like Mathematica. 
Lets look instead at the quartic surface 



f(x, y, z) = x 4 -x 3 + y 2 + z 2 



- 



which is also called the "piriform" or "pair shaped surface" . 
What is the equation for the tangent plane at the point P = 
(2, 2, 2)? We get (a, 6, c) = (20, 4, 4) and so the equation of 
the plane 2Qx + Ay + Az = 56. 




EXAMPLE. An important example of a level surface is 
g(x, y,z) = z — f(x, y) which is the graph of a function of 
two variables. The gradient of g is V/ = (—f x ,—f y ,l). 
This allows us to find the equation of the tangent plane at 
a point. 

Quizz: What is the relation between the gradient of / in 
the plane and the gradient of g in space? 




Lecture 12: 3/8/2004, LINEAR APPROXIMATION 



Math 21a, O. Knill 



HOMEWORK: 11.6: 4,12,18,40 (for Friday) 



LINE AR APPROXIMATION. 

| ID: | The linear approximation of a function f(x) at a point 
xq is the linear function L{x) = f(xo) + f'{xo){x — xq) . The 



graph of L is tangent to the graph of /. 

| 2D: | The linear approximation of a function f(x, y) at (xq, yo) 



L{x, y) = f{x , y ) + f x (x , yo)(x - x ) + f y (y - y ) 



The level curve of g is tangent to the level curve of / at (xo,yo). 
The graph of L is tangent to the graph of /. 



1 3D: | The linear approximation of a function f(x,y,z) at 
(xo,yo,z ) by 



L(x : y : z) = f(x ,y ,zo) + f x (x , y , z )(x - x ) + 
fy(x ,yo,z )(y - y ) + f z (x , y , z )(z - z ) 



The level surface of L is tangent to the level surface of / at 
(x ,yo,z ). 

In vector form, the linearization can be written as 



L(x) = f(x ) + Vf(x ) ■ (X - X ) 



JUSTIFYING THE LINEAR APPROXIMATION. 3 ways to see it: 

1) We know the tangent plane to g{x 1 y 1 z) = z — f{x,y) at (xo,yo,z + = f(xo,zo) is — f x x — f y y + z 
—fxXo — f y yo + zq. This can be read as z = zq + +f x (x — xo) + f y (y — yo)- Calling the right hand side L(x, y) 
shows that the graph of L is tangent to the graph of / at (xq, yo). 



2) The higher dimensional case can be reduced to the one dimensional case: if y = yo is fixed and x, then 
f(x,yo) = f(xo,yo) + f x (xo,yo){x — xo) is the linear approximation of the function. Similarly, if x = xo is 
fixed y is the single variable, then f(xo,y) = f(xo,yo) + fy(xo,Uo)(y — yo)- So along two directions, the linear 
approximations are the best. Together we get the approximation for f(x,y). 



3) An other justification is that V/(xo) is orthogonal to the level curve at xq- Because ft = V/(xo) is orthogonal 
to the plane n-(x-xo) = d also, the graphs of L(x, y) and f(x, y) have the same normal vector at {xo,yo, /(xq, yo). 



EXAMPLE (2D) Find the linear approximation of the func- 
tion f(x,y) = s'm(7Txy 2 ) at the point (1,1). The gradient is 
X7f(x,y) = (iry 2 cos(7Txy 2 ),2y7TCOs(7Txy 2 )). At the point (1,1), we 
have V/(l,l) = (it cos(7r), 2tt cos(7r)) = (— 7r, 2tt). The linear func- 
tion approximating / is L(x, y) = /(l, 1) + V/(l, 1) • (x — 1, y — 1) 
= — tt(x — 1) — 2ir(y — 1) = — irx — 2iry + 3tt. The level curves 
of G are the lines x + 2y = const. The line which passes through 
(1,1) satisfies x + 2y = 3. 

Application: -0.00943407 = /(1+0.01, 1+0.01) - #(1+0.01, 1+ 
0.01) = -ttO.01 - 2tt0.01 + 3tt = -0.00942478. 



EXAMPLE (3D) Find the linear approximation to f(x, y, z) = xy + yz + zx at the point (1, 1, 1). 




We have /(1,1,1) = 3, Vf{x,y,z) = (y + z, x + z, y + x), V/(l, 1, 1) = (2,2,2). Therefore L(x, y, z) = 
/(l, 1, 1) + (2, 2, 2) • {x - 1, y - 1, z - 1) = 3 + 2(x - 1) + 2(y - 1) + 2(z - 1) = 2x + 2y + 2z - 3. 



EXAMPLE (3D). Use the best linear approximation to f(x,y,z) = e x - s /yz to estimate the value of / at the 
point (0.01,24.8,1.02). 

Solution. Take {xo,yo,zo) = (0,25,1), where f{xo,yo,zo) = 5. The gradient is \7f(x,y,z) = 
{e x ^z,e x z/{2^) 1 e x ^/y). At the point (xo, yo, zo) = (0, 25, 1) the gradient is the vector (5,1/10,5). The linear 
approximation is L(x, y, z) = f(x , yo, zo) + V/(x , yo, z )(x - x Q , y - yo, z - z ) = 5 + (5, 1/10, 5)(x -0,y- 
25, z-l) = bx + y/10 + 5z- 2.5. We can approximate /(0.01, 24.8, 1.02) by 5 + (5, 1/10, 5) • (0.01, -0.2, 0.02) = 
5 + 0.05 - 0.02 + 0.10 = 5.13. The actual value is /(0.01, 24.8, 1.02) = 5.1306, very close to the estimate. 



fxx fx 



SECOND DERIVATIVE. 

If f(x,y) is a function of two variables, then the matrix f"(x,y) = I J J CX J j cy I is called the second 

|_ Jyx Ji 

derivative or the Hessian of /. 

For functions of three variables, the Hessian is the 3x3 matrix f"(x,y,z) = 



fxx fxy fxz 

fyx fyy fyz Because for 

fzx fzy fzz 

smooth functions, f xy = f yx , fyz = fzy, the matrix f" is symmetric (a reflection at the diagonal leaves it 
invariant). 



QUADRATIC APPROXIMATION, (informal) 

If F is a function of several variables x and xq is a point, then 



| Q(x) = /(go) + Vf(x )(x - xo) + [f"(x )(x - xo)} ■ (x - x )/2 | 
is called the quadratic approximation of x. 

It generalizes the quadratic approximation L(x) = f(xo) + f'(xo)(x — xo) + f"(xo)(x — xq) 2 /2 of a function of 
one variables. 



EXAMPLE. If the height of a hilly region is given by f{x,y) = 4000 — sin(x 2 + y 2 ), find the quadratic 
approximation of F at (0,0). 

Vf(x,y) = (2x,2y)cos(x 2 + y 2 ) so that V/(0,0) = (0,0). The linear approximation of F at (0,0) is 
G(x, y) = /(0, 0) = 4000. The graph of G is a plane tangent to the graph of F. 

f"(x,y) = cos(x 2 + y 2 ) | J J 

so that 

no,o) = [J ; 

The quadratic approximation at (0, 0) is Q(x, y) = 4000 — x 2 — y 2 . The graph of F is an inverted paraboloid. 



EXAMPLE: REDUCED VAN DER WAALS LAW T(p, V) = (p + 3/V 2 )(3V - l)/8 Find the quadratic approx- 
imation of T at (p,V) = (1,1). We had T p (p,V) = (3V-l)/8 and T v (p,V) = 3p/8 - (9/16)l/y 2 - 3/(4y 3 ) 
and VT(1,1) = (1/4,0). Now, T pp {p,V) = 0, T pV = 3/8, T Vp = 3/8 and T vv = (9/8)l/y 3 + (9/4)l/y 4 so 

that T"(l, 1) = | | . The quadratic approximation at T = (1, 1) is Q(p, T) = T(l, 1) + T'(l, 1) • (p - 

1,T- 1) + [T"(l,l)(p- 1,T- 1)] • (p- \,T- l)/2. 



ERROR OF APPROXIMATION. It follows from Taylors theorem that the error f(x,y) - L(x,y) in a region 
R near (xo, yo) is smaller or equal to M(\x — xo\ + \y — yo\) 2 /2, where M is the maximal value of all the matrix 
entries f"(x,y) in that region R. 



TOTAL DIFFERENTIAL. Aiming to estimate the change A/ = f(x,y) - f{x ,yo) of / for points (x,y) = 
(xo,yo) + {Ax, Ay) near (xo,yo), we can estimate it with the linear approximation which is L(Ax,Ay) = 
f x (xo,yo)Ax + f y Ay. In an old-fashioned notation, one writes also df = f x dx + f y dy and calls df the total 
differential. One can totally avoid the notation of the total differential. 



RELATIVE AND ABSOLUTE CHANGE. The relative change A/ depends on the magnitude of /. One also 
defines Af(xo,yo)/f(xo,Vo), the relative change of /. A good estimate is AL(xo,yo)/L(xo,yo). 



PHYSICAL LAWS. 



Many physical laws are in fact linear approximations to more complicated laws. One could say that a large 
fraction of physics consists of understand nature with linear laws. 

LINEAR STABILITY ANALYSIS. 

In physics, complicated situations can occur. Usually, many unknown parameters are present and the only 
way to analyze the situation theoretically is to assume that things depend linearly on these parameters. The 
analysis of the linear situation allows then to predict for example the stability of the system with respect to 
perturbations. Sometimes, the stability of the linearized system will imply the stability of the perturbation. 

ERROR ANALYSIS. 

Error analysis is based on linear approximation. Assume, you make a measurement of a function f(a,b,c), 
where a, b, c are parameters. Assume, you know the numbers a, b, c up to accuracy e. How precise do you know 
the values f(a, b, c)? Because f(ao + e , 60 + Q>, Co + e c ) is about f(ao, bo, Co) + V/(ao, bo, Co) • (e a , €b, e c ), the 
answer is that we know F up to accuracy | V f{ao, bo, co)|e. 

POWER LAWS. 

Some laws in physics are given by functions of the form g{x,y) = x a y& '. An example is the Cobb-Douglas 
formula in economics. Such dependence on x or y is called power law behavior. If we consider / = log(g), 
and introduce a = \og(x),b = \og(y), then this becomes f(a,b) = \og(g(x,y)) = aa + b/3. Power laws become 
linear laws in a logarithmic scale. But they usually are linear approximations to more complicated nonlinear 
relations. 



ELECTRONICS. 

If we apply a voltage difference U at the ends of a resistor R, then a current / flows. The relation U = RI 
is called Ohms law. In logarithmic coordinates log(?7) = log(R) + log(J), this is a linear law. In reality, the 
relation between current, voltage and resistance is more complicated. For example, if the resistor heats up, 
then its characteristics begin to change. Nonlinear resistors are used for example in synthesizers or in radars. 
While Ohm's law works extremely well, the nonlinear behavior can have important consequences for example 
to stabilize systems or to protect equipment against over- voltages. 

THERMODYNAMICS. 

If l(T) is the length of an object with temperature T, then l(T) = I (To) + c(T — To), where the expansion 
coefficient c depends on the material. (Trick question: What happens if you heat a ring, does the inner ring 
become smaller or bigger?). The volume of a hot air balloon and therefore its lift capacity grows like c(T — T ) 3 . 
The law of expansion is only an approximation. 

OCEANOGRAPHY. 

For oceanographers, it is important to know the water density p(T, S) in dependence on the temperature T 
(Kelvin) and the salinity S (psu). If we would include the pressure P (Bar), then we had a function p(T, S, P) 
of three variables. Near a specific point (T, S, P) the density can be approximated by a linear function giving 
a law which is precise enough. 

ENGENEERING. 

Hooks law tells that the force of a spring is proportional to the length with which it is pulled: F(l) = c(l — lo), 
where lo is the length when the spring is relaxed. This allows to measure weights or to cushion shocks. 
However, this law is only good in a certain range. If the spring is pulled too strongly, then more force is needed. 
Such a nonlinear behavior is needed for example in shock absorbers. 



MECHANICS 

For small amplitudes, the pendulum motion x = — gs'm(x) can be approximated by 'x = — gx, the harmonic 
oscillator. Nonlinear (partial) differential equations like u xx + u yy + u zz = F(x, y, z) are often approximated 
by linear differential equations. 

CARTOGRAPHY. 

It was well known already to the Greeks that we live on a sphere. On a sphere a triangle however the sum 
of its angles adds up to more than 180 degrees and every straight line (great circles) crosses every other line 
at least twice. Despite this, a city map can perfectly assume that the coordinate system is Cartesian. When 
drawing a plan of a house, an architect can assume that the house stands on a plane (the level curve of the 
linearization G(x,y) of F(x,y) defining the surface of the earth.) 

RELATIVITY 

Newton's law tells that r"(t), the acceleration of a particle is proportional to the force F which acts on the 
mass point: r"(t) = F/m. For a constant force and zero initial velocity this implies r'(t) = tF/m. This law can 
not apply for all times, because we can not reach the speed of light with a massive body. In special relativity, 
the Newton axiom is replaced with d / dt(r' (t)m(t)) = F, where the mass m(t) depends on the velocity. This 
gives v(t) = (tF/mo) , 2 \ 2 2 • Linearization at t = produces the classical law v(t) = tF/rriQ. 

ECONOMICS. 

The mathematician Charles W. Cobb and the economist Paul H. Douglas found in 1928 empirically a formula 
F(L, K) = bL a K? giving the total production F of an economic system as a function of the amount of labor L 
and the capital investment K. This is a linear law in logarithmic coordinates. The formula actually had been 
found by linear fit of empirical data. In general, the production depends in a more complicated way on labor and 
capital investment. For example, with increase of labor and investment, logistic constraints will become relevant. 

CHEMISTRY. 

The ideal gas law PV = RT relates the pressure, the volume and the temperature of an ideal gas using a 
constant R called the Avogadro number. This law T = f(P, V) is linear in logarithmic scales. This law is only 
an approximation and has to be replaced by the van der Waals law, which takes into account the molecular 
interactions as well as the volume of the molecules. 



FIRST MIDTERM REVIEW, 3/10/2004 



Math 21a 



GENERAL TIPS. 



• Ask questions: 



• Review the online quizzes. 

• Make list of facts on a sheet of paper. 

• Fresh up short-term memory before test. 

• Review homework. Find error patterns. 

• During the exam: read the questions carefully. Wrong understanding could lead you to solve an other problem: 



"Ask a question and you're a fool for 
three minutes; do not ask a question 
and you're a fool for the rest of your 
life." - Chinese Proverb 



There was a college student trying to earn some pocket money by going from house to house offering to 
do odd jobs. He explained this to a man who answered one door. "How much will you charge to paint 
my porch?" asked the man. "Forty dollars." "Fine" said the man, and gave the student the paint and 
brushes. Three hours later the paint-splattered lad knocked on the door again. "All done!", he says, and 
collects his money. "By the way," the student says, "That's not a Porsche, it's a Ferrari." 



MIDTERM TOPICS. 

• Properties of dot, cross and triple product 

• Orthogonality, parallel, vector projection 

• Parametrized Lines and Planes 

• Given line and plane, find intersection 

• Given plane and plane, find intersection 

• Given line and point, find plane 

• Given two points, find line 

• Given three points, find plane 

• Distances: point-line, line-line,point-plane 

• Distinguish and analyse curves 

• Determine curves from acceleration 

• Know Keplers laws, polar form of ellipse 

• Recognize functions f(x,y) of two variables. 

• Tangent lines, tangent curves 



• Distance between two lines 

• Distance between two planes 

• Angle between two vectors 

• Angle between two planes 

• Area of parallelogram, triangle in space 

• Volume of parallelepiped 

• Distinguish contour maps, graphs 

• Compute velocity, acceleration, speed 

• Integrate from velocity to get position 

• Find length of curves 

• Level curves, level surfaces 

• Directional derivative 

• Chain rule 

• Implicit differentiation 

• Tangent planes 



VECTORS. 

Two points P = (1,2,3),Q = (3,4,6) define a vec- 
tor v = PQ = (2,2,3). If v = Xw, then the vec- 
tors are parallel if v • w = 0, then the vectors are 
called orthogonal. For example, (1,2,3) is par- 
allel to (-2,-4,-6) and orthogonal to (3,-2,1). 
The addition, subtraction and scalar multiplication 
of vectors is done componentwise. For example: 
(3,2,1) -2((1, 1,1) + (-1,-1,0)) = (3,2,-1). 



A nonzero vector v and a point P = (xo,yo,zo) de- 
fine a line f(t) — P + tv. Two nonzero, non-parallel 
vectors v, w and a point P define a plane P + tv + ss. 
The vector n = v x w = (a, 6, c) is orthogonal to 
the plane. Points on the line satisfy the symmet- 
ric equation = = ^a. Points on the 
plane satisfy an equation ax + by + cz = d, where 
d = axo + byo + czq. Using the dot product for pro- 
jection and the vector product to get orthogonal vec- 
tors, one can solve many geometric problems in 3D. 



DOT PRODUCT (is scalar) 
v ■ w = w ■ V 
\v • w\ = \v\\w\ cos(q;) 
(av) ■ w = a(v ■ w) 
(u + v) -w == u • w + v ■ w 
{1,2, 3}. {3, 4,5} 
i(v-w) = (f t v)-w + (v-^ 



CROSS PRODUCT (is vector) 

commutative v x w = —w x v anti-commutative 

angle \v x w\ — \v\\w\sm(a) angle 

linearity (av) x w = a(v x w) linearity 

distributivity (u + v) xw=uxw+vxw distributivity 

in Mathematica Cross[{l, 2, 3}, {3, 4, 5}] Mathematica 

product rule ^(v x w) = (jj^v) x w + v x (jj^w) product rule 



PROJECTIONS. 
Vector projection: 



proj^(w;) = 



Is a vector parallel to w. 



Scalar projection: 



comp-(w) = |proj#(iu)| = 



the length of the projected 
vector. 



Applications: 

• Distance P+tv, Q+sw is scalar 
projection of PQ onto v x w. 

• Distance P, Q+tv+sw is scalar 
projection of PQ onto n = vxw. 



SURFACES {f(x,y,z) = c}. 

Examples are graphs, where f(x,y,z) = z — g(x,y) = or planes, where f(x,y,z) = ax + by + cz = c. 
Surfaces can be analyzed by looking at intersections with planes parallel to the coordinate planes. For graphs, 
the traces f(x,y) = c are contour lines. Most important fact: 



The gradient Vf(xo, yo, zq) is normal to the surface f(x, y,z) = c containing (xq, yo, zq). 



SURFACES EXAMPLES 



• sphere x 2 + y 2 + z 2 = 1 

• cylinder x 2 + y 2 = 1 

• ellipsoid x 2 /a 2 + y 2 /b 2 + z 2 /c 2 = 1 

• cone x 2 + y 2 — z 2 = 

• plane ax + by + cz = d 



• one sheeted hyperboloid x 2 + y 2 — z 2 = 

• two sheeted hyperboloid x 2 + y 2 — z 2 = 

• paraboloid x 2 + y 2 — z = 

• hyperbolic paraboloid x 2 — y 2 — z = 

• graph of function g(x, y) — z — 



can be identified using traces, the intersections with planes. 



CURVES. 

f(t) = (x(t),y(t), z(t)), t E [a,b] defines a curve. By differentiation, we obtain the velocity f '(t) and 
acceleration f "(t). If we integrate the speed \f '(t)\ over the interval [a,b], we obtain the length of the 
curve. 



f a ^/x'(t) 2 + y'(t) 2 + z'(t) 2 dt 



Example: f(t) = (l,3t 2 ,t 3 ), i*(t) = (0,6£,3£ 2 ), so that |r*(£)| = 3£(4 + £ 2 ). The length of the curve between 
and 1 is J* 3t(4 + t 2 ) dt = 6t 2 + 3^|J = 6 • |. 

DIRECTIONAL DERIVATIVE. 

For any vector v and a function f(x, y), define D$f(x, y) = Vf(x, y) ■ v. Unlike in many textbooks: 



We do not divide by \v\ to compute the directional derivative. 



CHAIN RULE. 

We have seen that d/dtf(f(t)) = Vf(f(t)) ■ f*(£). This works also, if f(t, s) is a function of two variables: 
f t (x(t,s),y(t,s)) = Vf(r(t,s))-r t (t,s). 
f s (x(t, s),y(t, s)) = V/(f(£, s)) ■ r s (t, s). 

Other variables: w(u,v) function of u,v, where u, v are functions of x and eventually of y: 



dw dw du _|_ dw dv 

Ox ~ du dx ^ dv dx 



dw dw du _|_ dw dv 

dx ~ du dx ^ dv dx 



For any three vectors a, b and c, we always have 

(a x 6) • c = — (a x c) ■ b 



The set of points in the xy-pl&ne which satisfy x 2 — y 2 = — 1 form a hyperbola. 



The velocity vector f'(t) and the acceleration vector r"(t) of a curve are always perpendicular. 



The length of a curve does not depend on the parameterization chosen for the curve. 



The directional derivative of f(x, y, z) = x 2 + y 2 — z 2 into the direction (1, 1, 1) at the point (0, 0, 1) is 
-2/VS. 



If \v x w\ = for all vectors w, then v = 0. 



Every vector contained in the line f(t) = (1 + 2t, 1 + 3£, 1 + 4t) is parallel to the vector (1,1,1) 



If the velocity vector r '(t) of the planar curve f(t) is orthogonal to the vector f(t) for all times £, then 
the curve is a circle. 



The vectors i + j and k are orthogonal. 



If the velocity vector of the curve f(t) is never zero and always parallel to a constant vector v for all times 
t, then the curve is a straight line. 



If u + v and u — v are orthogonal, then the vectors u and v have the same length. 



The identity \v ■ w\ 2 + \v x w\ 2 = \v\ 2 \w\ 2 holds for all vectors v,w. 



The set of points which have distance 1 from a line form a cylinder. 



The distance between a point P and a line L : f(t) = Q + tv is \v x PQ\/\PQ\. 



If w(u, v) is a function of two variables u, v and u, v are functions of x, y, then w x = w u u x + w y v x . 



The gradient Vf(x, y, z) is always tangent to the surface f(x, y, z) = c. 



The direction of steepest decent at a point (x,y) is Vf(x,y). 



The vector projection of a vector v onto a vector w is ^^-v. 



In the Kepler problem, one has d/dt(f ■ r 1 ) = 0. 



For any two vectors v,w one has \\v + w\\ 2 = \ \v\\ 2 + \ \w\\ 2 . 



For any two vectors v and w one has v x w = w x v. 



ix J — k = 



If your thumb u, your pointing finger v and middle finger w are perpendicular to each other, and u x v 
points in the direction opposite to w, then the three fingers belong to the left hand. 



If f(t) = Q + tv is a line and P is a point such that PQ is perpendicular to v, then the distance from P 
to the line is equal to the distance from P to the line. 



The distance of the two planes x + y + z = 3 and x + y + z = 5 is 2/y/S. 



A level curve of a function f(x, y) is a curve which has no self intersections. 



The angle between two planes is equal to the angle between its normal vectors. 



The speed of a curve f(t) is a vector tangent to the curve. 



If the acceleration vector of a curve f(t) is parallel to f(t) for all times, then the curve must lie in a plane. 



The arc length of a curve r(£) parameterized on the interval [a, b] is J^(x'(t) 2 + y'(t) 2 + z'(t) 2 ) dt. 



Lecture 14: 3/12/2004, CRITICAL POINTS 



Math21a, O. Knill 



HOMEWORK: Section 11.7: 8,18,20,22 



CRITICAL POINTS. A point (x ,y ) in a region G is called a critical point of f(x,y) if Vf{x ,yo) = (0,0). 
Remarks. Critical points are also called stationary points. Critical points are candidates for extrema because 
at critical points, the directional derivative is zero. It is usually assumed that / is differentiable. 

EXAMPLE 1. f(x,y) = x 4 + y 4 -4xy + 2. The gradient is Vf(x,y) = {4{x 3 - y), 4(y 3 - x)) with critical points 
(0,0), (1,1), (-1,-1). 

EXAMPLE 2. f(x,y) = sin(x 2 + y) + y. The gradient is Vf(x,y) = (2xcos(x 2 + y),cos(x 2 + y) + 1). For 
a critical points, we must have x = and cos(y) + 1=0 which means tt + k2ir. The critical points are at 
(0,tt), (0,3tt) 

EXAMPLE 3. ("volcano") f(x,y) = (x 2 + y 2 )e~ x2 - y2 . The gradient VF = (2x - 2x(x 2 + y 2 ),2y - 2y(x 2 + 
y 2 ))e~ x ~ y vanishes at (0,0) and on the circle x 2 + y 2 = 1. There are oo many critical points. 
EXAMPLE 4 ("pendulum") f(x,y) = —gcos(x) + y 2 /2 is the energy of the pendulum. The gradient VF = 
(y, —gsin(x)) is (0, 0) for x = 0, tt, 2tt, y = 0. These points are equilibrium points, where the pendulum is at 
rest. 

EXAMPLE 5 ("Volterra Lodka") f(x,y) = a\og(y) — by + clog(x) — dx. (This function is left invariant by the 
flow of the Volterra Lodka differential equation x = ax — bxy, y = —cy + dxy which you might have seen in 
Mathlb.) The point (c/d,a/b) is a critical point. 



TYPICAL EXAMPLES. 










% 














f(x,y) = x 2 + y 2 


f(x,y) = -x 2 - y 2 


f(x,y) = x 2 -y 2 



EXAMPLES WITH DISCRIMINANT D = det{H) = 0. 






f(x,y) = x 2 



f(x,y) -- 



f(x,y) = x 4 + y A 



CLASSIFICATION OF CRITICAL POINTS IN 1 DIMENSION. 

f'(x) = 0, f"(x) > 0, local minimum, f"(x) < local maximum, /" = undetermined. 







\ : 




\ 







CLASSIFICATION OF CRITICAL POINTS: SECOND DERIVATIVE TEST. Let f{x,y) be a function of 
two variables with a critical point (xo,yo). Define D = f xx fyy — f xy , called the discriminant or Hessian. 

fxx fxj 
fyx fyj 



(Remark: With the Hessian matrix H - 



we can write D = det(H) as a determinant. 



If D > and Hn > => local minimum (bottom of valley) 
If D > and Hn < => local maximum (top of mountain). 
If D < saddle point (mountain ] 



In the case D = 0, we would need 
higher derivatives to determine the na- 
ture of the the critical point. 



EXAMPLE. (A "napkin"). 

The function f(x, y) = x 3 /3 — x — {y 3 /3 — y) has the gradient V/(x, y) = (x 2 — 1, — y 2 + 1). It is the zero vector 

r 2x o i 

at the 4 critical points (1, 1),(— 1, 1),(1, — 1) and (— 1, — 1). The Hessian matrix is H = f"{x, y) = o r 



H{l,l) = 


"2 
-2 




H(-l,l) = 


" -2 
-2 




#(!,-!) = 


"20" 
2 




#(-1,-1) = 


"-20" 
2 




D = -4 
Saddle point 


D = 4,f xx = -2 
Local maximum 


D = 4,f xx = 2 
Local minimum 


D = -4 
Saddle point 




GLOBAL MAXIMA AND MIN- 
IMA. To determine the maximum 
or minimum of f(x, y) on a domain, 
determine all critical points in 
the interior the domain, and 
compare their values with maxima 
or minima at the boundary. 



(A point is in the interior of G, if there is a small disc around (xo, yo) contained in G. A point is at the boundary of G, 
if any disc aound (xo,yo) contains both points in G and in the complement). 



EXAMPLE 5. Find the critical points of f(x, y) = 2x 2 - x 3 - y 2 . With Vf(x, y) = 4x - 3x 2 , -2y), the critical 

r 4 - 6x 1 

points are (4/3,0) and (0,0). The Hessian is H(x,y) = q 2 " ^ ^ e discriminant is —8 so 

that this is a saddle point. At (4/3, 0), the discriminant is 8 and Hn = 4/3, so that (4/3, 0) is a local maximum. 



WHY DO WE CARE ABOUT CRITICAL POINTS? 

• Critical points are candidates for extrema like maxima or minima. 

• Knowing all the critical points and their nature tells alot about the function. 

• Critical points are physically relevant. Examples are configurations with lowest energy). 



A CURIOUS OBSERVATION: (The island theorem) Let f(x,y) 
be the height on an island. Assume there are only finitely many 
critical points on the island and all of them have nonzero de- 
terminant. Label each critical point with a +1 "charge" if it is 
a maximum or minium, and with —1 "charge" if it is a saddle 
point. Sum up all the charges and you will get 1, independent of 
the function. This property is an example of an "index theorem", 
a prototype for important theorems in physics and mathematics. 




CRITICAL POINTS IN PHYSICS, (informal) Most physical laws are based on the principle that the equations 
are critical points of a functional (in general in infinite dimensions). 

• Newton equations. (Classical mechanics) A particle of mass m moving in a field V along a path 7 : t 1— > r(t) 
extremizes the integral 5(7) = mr'(t) 2 /2 — V(r(t)) dt. Critical points 7 satisfy the Newton equations 
mr"{t)/2- W(r(t)) = 0. 

• Maxwell equations. (Electromagnetism) The electromagnetic field (E, B) extremizes the Integral S(E, B) = 
^ J (E 2 — B 2 ) dV over space time. Critical points are described by the the Maxwell equations in vacuum. 

• Einstein equations (General relativity) If g is a dot product which depends on space and time, and R is 
the "curvature" of the corresponding curved space time, then S(g) = J J R dV(g) is a function of g for which 
critical points g satisfy the Einstein equations in general relativity. 

OTHER WAYS TO FIND CRITICAL POINTS. Some ideas: walk in the direction of the gradient until you 
reach a local maximum or walk backwards to reach a local minima. To find saddle points, consider the shortest 
path connecting two local minima and take the maximum along this path. 



3/15/2004 ICE: COBB-DOUGLAS FORMULA 

The Mathematician Charles W. Cobb and the economist 
Paul H. Douglas (picture) found in 1928 empirically 
a formula F(L, K) = bL a K^ which gives the total 
production F of an economic system as a function of 
the amount of labor L and the capital investment K. 

By fitting data, they got b = 1.01, a = 0.75,/? = 0.25. 
By rescaling the production unit we can get 6=1 and 
work with the formula: 

F(L,K) = L^K 1 ^ 

Assume that the labor and capital investment are bound 
by the constraint G(L, K) = L 3 / 4 + K 1 / 4 = 1. Where is 
the production P maximal under this constraint? 
VF(L,K) = 



VG{L,K) = 



Solve: VF{L,K) = \VG(L,K),G(L,K) = 1: 



Lecture 15: 3/15/2004, CONSTRAINED EXTREMA 



Math21a, O. Knill 



Section 11.7: 32 e,f, 40, Section 11.8: 2,6 



CONSTRAINED EXTREMA. Given a function f(x,y) of two 
variables and a level curve g(x,y) = c. Find the extrema of / 
on the curve. You see that at places, where the gradient of / is 
not parallel to the gradient of g, the function / changes when we 
change position on the curve g = c. Therefore we must have a 
solution of three equations 

V/0, y) = XVg(x, y),g(x, y) = c 

to the three unknowns (x,y,X). (Additionally: check points with 
\7g(x,y) = (0,0)). The constant A is called the Lagrange mul- 
tiplier. The equations obtained from the gradients are called 
Lagrange equations. 



EXAMPLE. To find the shortest distance from the origin to the 
curve x 6 + 3y 2 = 1, we extremize f(x,y) = x 2 + y 2 under the 
constraint g(x, y) = x 6 + Sy 2 — 1 = 0. 
SOLUTION. 

V/ = (2x,2y),Vg = (6x 5 ,6y). Vf = AV g gives the system 
2x = X6x 5 , 2y = XQy, x 6 + 3y 2 - 1 = 0. 

A = 1/3, x = x 5 , so that either x = or 1 or —1. From the 
constraint equation, we obtain y = y/Jl — x 6 )/3. So, we have 
the solutions (Q,±y/T/3) and (1, 0), (-1, 0). To see which is the 
minimum, just evaluate / on each of the points. We see that 
(0, ± y/T/3) are the minima. 



HIGHER DIMENSIONS. The above constrained extrema problem works also in more dimensions. For example, 
if f(x, y, z) is a function of three variables and g(x, y, z) = c is a surface, we solve the system of 4 equations 

| V/Qk, y, z) = XVgjx, y, z),g(x, y) = c\ 

to the 4 unknowns (x, y, z, A). In n dimensions, we have n + 1 equations and n + 1 unknowns {x\, . . . , x n , A). 

EXAMPLE. Extrema of f(x,y,z) = z on the sphere g{x 1 y 1 z) = x 2 + y 2 + z 2 = 1 are obtained by calcu- 
lating the gradients Vf(x,y,z) = (0,0,1), Vg(x,y,z) = (2x,2y,2z) and solving (0,0,1) = V/ = XVg = 
(2Xx, 2Xy, 2Xz), x 2 -\-y 2 + z 2 = 1. A = is excluded by the third equation 1 = 2Xz so that the first two equations 
2Xx = 0, 2Xy = give x — 0, y = 0. The 4'th equation gives z = 1 or z = — 1. The extrema are the north pole 
(0,0, 1) (maximum) and the south pole (0,0, —1) (minimum). 



THE PRINCIPLE OF MAXIMAL ENTROPY. Consider a dice showing % with probability p { , where i = 1, 6. 
The entropy of the probability distribution is defined as S(p) = — Y^i=i Pi^°z{Pi)- Find the distribution p 
which maximizes entropy under the constrained g{p) = Y^i=\Pi = 1- 

SOLUTION: V/ = (-1 - log(pi), . . . , -1 - log(p n )), Vg = (1, . . . , 1). The 
Lagrange equations are —1 — log(pi) = A,pi + ... + Pq = 1, from which we get 
Pi = e ~( x+1 \ The last equation 1 = E; ex P(-( A + !)) = 6exp(-(A + 1)) fixes 
A = — log(l/6) — 1 so that pi = 1/6. The distribution, where each event has 
the same probability is the distribution of maximal entropy. 

REMARK. Maximal entropy means least information content. A dice 
which is fixed (asymmetric weight distribution for example) allows a cheat- 
ing gambler to gain profit. Cheating through asymmetric weight distributions 
can be avoided by making the dices transparent. 






THE PRINCIPLE OF MINIMAL FREE ENERGY. Assume that the probability that a system is in a state 
i is Pi and that the energy of the state % is Ei. By a fundamental principle, nature tries to minimize the 
free energy f(pi,...,p n ) = — log(pj) — EiPi when the energies Ei are fixed. The free energy is the 

difference of the entropy S(p) = — YliPi l°g(Pi) an d the energy E(p) = J2i EiPi. The probability distribution 
Pi satisfying YliPi = 1 minimizing the free energy is called the Gibbs distribution. 

SOLUTION: V/ = (-1 - log(pi) -E 1 ,...,-l- log(p n ) - E n ), Vg = (1, . . . , 1). The Lagrange equation are 

log(pi) = — 1 — A — Ei, or pi = exp(—Ei)C, where C = exp(— 1 — A). The additional equation p\ + Yp n = 1 

gives CC i exp(-E i )) = 1 so that C = lAE^"^)- Th e Gibbs solution is p t = exp(-Ei)/ J2i exp(-^). 
For example, if Ei = E, then the Gibbs distribution is the uniform distribution Pi = 1/n. 



MOST ECONOMIC ALUMINUM CAN. You manufacture cylindrical soda cans of height h and radius r. 
You want for a fixed volume V(r, h) = hirr 2 = 1 a minimal surface area A(r, h) = 2irrh + 2-Kr 2 . With 
x = hir,y = r, you need to optimize f(x,y) = 2xy + 2iry 2 under the constrained g(x,y) = xy 2 = 1. Calculate 
Vf(x, y) = (2y, 2x + 4iry), Vg(x, y) = (y 2 , 2xy). The task is to solve 2y = Xy 2 , 2x + Any = X2xy, xy 2 = 1. The 
first equation gives yX = 2. Putting that in the second one gives 2x + 47ry = 4x or 2ny = x. The third equation 
finally reveals 2vn/ 3 = 1 or y = 1/(2tt) 1 / 3 , x = 2tt{2tt) 1 / 3 . This means h = 0.54.., r = 2h = 1.08 
REMARK. Other factors can influence the shape also. For example, the can has to withstand pressure forces 
up to 100 psi. 



WHERE DO CONSTRAINED EXTREMAL PROBLEMS OCCUR, (informal) 

• Constraints occur at boundaries. If we want to maximize a function over a region, we have also to look at the 
extrema at the boundary, where the gradient not necessarily vanishes. 

• In physics, we often have conserved quantities (like for example energy, or momentum, or angular momentum), 
constraints occur naturally. 

• In economical contexts, one often wants to optimize things under constraints. 

• In mechanics constraints occur naturally (i.e. for robot arms). 

• Constrained optimization is important in statistical mechanics, where equilibria have to be obtained under con- 
straints (i.e. constant energy). 

• Probability distributions are solutions of constrained problems. For example, the Gaussian distribution (normal 
distribution) is the distribution on the line with maximal entropy. 

• Eigenvalue problems in linear algebra can be interpreted as constrained problems. Maximizing u ■ Lu under the 
condition \u\ 2 = 1 gives the equation Lu = Am and u has to be an eigenvector. The Lagrange multiplier is an 
eigenvalue. 



DID WE GET ALL EXTREMA? The Lagrange equations 
Vf(x,y) = XVg(x,y),g(x,y) = c do not give all the extrema. 
We can also have S7g(x,y) = (0,0). The parallelity condition 
also could have been written as XVf(x,y) = Vg(x,y), f(x,y) = 
x 2 + {y ~ I) 2 5 9i x , y) — % 2 ~ y 3 - The function / has a local maxi- 
mum 1 at (0, 0) under the constraint g(x, y) = but the Lagrange 
equations do not find it. The problem is that the gradient of g van- 
ishes. Vg is technically parallel to V/ but there is no A such that 
V/ = AV g at this point. The reason for this "mistake" (which 
is present in virtually all calculus text books), is that parallelity 
of the two gradient is not equivalent to V/ = AVg but can also 
mean AV / = Vg with A = 0. 




CAN WE AVOID LAGRANGE? We could extremize f(x,y) under the constraint g(x,y) 
y = y(x) from the later and extremizing the ID problem f(x,y(x)). 



by finding 



EXAMPLE I. To extremize f(x,y) = x 2 + y 2 
with constraint g(x,y) = x 4 + 3y 2 — 1 = 0, solve 
y 2 = (1 — x 4 )/3 and minimize h(x) = f(x,y(x)) = 
x 2 + (1 - x 4 )/3. h'{x) = gives x = 0. The find 
the maximum (±1,0), we had to maximize h(x) on 
f — 1,1], which occurs at ±1. 



Sometimes, 
avoided. 



the Lagrange method can be 



EXAMPLE 2. Extremize f(x,y) = x 2 + y 2 un- 
der the constraint g(x,y) = p(x) + p{y) = 1, 
where p is a complicated function in x which sat- 
isfies p(0) = 0,p'(l) = 2 The Lagrange equations 
2x = Xp'(x),2y = Xp'(y),p(x) + p(y) = 1 can be 
solved: with x = 0,y = 1,X = 1, however, we can 
not solve g(x,y) = 1 for y. Substitution fails: 
In general, the Lagrange method is more pow- 
erful. 



Lecture 16: 4/17/2004, MORE ON EXTREMA 



Math21a, O. Knill 



HOMEWORK: Section 11.8: 10,20,24,42 



MAXIMAL AREA OF RECTANGLE. We want 
to extremize the area of a rectangle for which the 
length of the boundary is fixed 4. If the sides are 
x and y, then we want to extremize f(x,y) = xy 
under the constraint g(x, y) = 2x + 2y = 4. The 
Lagrange equations y = 2A, x = 2A show that 
x = y and so x — y = 1. 



The last problem could also been solved by substituting 
y = 2 — x into the area formula A — xy — x{2 — x) lead- 
ing to a one-dimensional extremal problem: maximize 
f(x) = x(2 — x) on the interval [0, 2]. To do so, we have 
to find the extrema inside the interval and then consider 
also the boundary points x = 0,x = 2. Again, we get 
x = l. 



VOLUME OF CUBE. Extremize the volume f(x, y, z) = xyz of a box with fixed surface area xy + yz + xz = 3. 
To solve yz = X(y + z),xz = X(x + z),xy = X(x + y),xy + yz + xz = 1, take quotients: 
z/x = (y + z)/(y + x),z/y = [z + x)/(y + x) which gives z(y + x) — x(y + z),z(y + x) = y(z + x) so 
that either xz = yz or z = 0. Similarly, we get y = z or y = 0. The solution m x = y = z = 1. 



AN OTHER SOLUTION. For a solution without Lagrange multipliers, we would plug in z = (1 — xy)/(y + x) 
and try to find the maximum of f(x, y) = xy(l — xy)/(y + x) on the domain D = {x > 0, y > 0, xy < 1 }. 

We first would have to find critical points inside the region D: 

fx(x, y) = -2/(1 - 2xy)/(x + y)- xy(l - xy)/(x + y) 2 = 

fy(x, V) = - 2xy)/(x + y) - xy(l - xy)/(x + y) 2 = 

The difference of these two equations gives (x — y)(l — 2xy) = so that either x = y or xy = 1/2. The 
second case can not give us f x = f y = 0. The first condition x = y gives x = j/ = 1 which is not inside the 
region. However, on the boundary g(x,y) = xy = 1, the Lagrange equations V/ = XVg have a solution with 
(z,y) = (l,l). 



The example illustrates the power of Lagrange multipliers. The substitution method is more complicated. 



TWO CONSTRAINTS, (informal) The calculation with Lagrange mul- 
tipliers can be generalized: if the goal is to optimize a function f(x, y, z) 
under the constraints g(x,y,z) = c, h(x,y,z) = d, take the Lagrange 
equations 

pyf = AV(? + /xV/i, g = c ,h = d\ 

which are 5 equations for the 5 unknowns (x, y, z, A, /x). Geometrically 
the gradient of / is in the plane spanned by the gradients of g and h. 
(This is the plane orthogonal to the curve {g = c, h = d}.) 



GENERAL PROBLEM. Given a region G whose boundary is given by g(x : y) = c. The task to maximize or 
minimize f(x,y) on G has the following steps: 

I) Find extrema inside the region: compute critical points V/ = (0, 0) and classify them using the second 
derivative test. 

II) Find extrema on the boundary using Lagrange: V/ = XVg,g = c. 

III) Compare the values of the functions obtained in I) and II) to find the maximum or minimum. 






EXAMPLE. Extremize f(x, y) = 3x 2 -4x-y 2 on the disc x 2 +y 2 < 
1. 

I) Inside the disc. There is only one critical point (2/3,0). The 
discriminant D is —6 so that (2/3,0) is a saddle point. 

II) On the boundary solve 6x — 4 = 2Xx, —2y = 2Xy. There are 
four solutions: (1/2, -y/3/2), (1/2, +V3/2), (1,0), (-1,0). 

III) A list of all candidates: 



(x,y) 


f(x,y) 


(2/3,0) 


-4/3 


(1/2,-^3/2) 


-2 


(1/2,^3/2) 


-2 


(1,0) 


-1 


(-1,0) 


7 



reveals that (—1,0) is the maximum and (1/2, — ± \/3/2) are min- 
ima. 




IN MATHEMATICA. 

Here is how a machine solves the above problem. After defining the functions / and g, the machine solves first 
the equations leading to critical points, and then the Lagrange equations (we put L = X). 

/[#_, y_] := 3x 2 — Ax — y 2 
g[x-,y-} :=x 2 + y 2 -l 

So\ve[{D[f[x,y} : x] == 0, D[f[x, y], y] == 0} : {x : y}} 

So\ve[{D[f[x,y} : x] == L * D\g[x,y],x],D[f[x,y],y] == L * D\g[x,y],y],g[x,y] == 0}, {x, y, L}] 



TRICKY LAGRANGE PROBLEM. Let p and q be positive constants such that 1 + 1 = 1. Use the method of 
Lagrange Multipliers to prove that for any x > 0, y > 0, the following inequality is true: 

x p y q 
xy < — + — . 
v q 

SOLUTION. 

We have to find the maximum of f(x, y) = xy (x > 0, y > 0) under the constraint — + — = c.) 
The Lagrange equations x = A^ p_1 ,?/ = Xy q ~ x gives y/x = x p ~ 1 /y q ~ 1 so that y q = x p . 

Plugging this into x p /q + y q = c gives x p (l/p + 1/q) = c or x = c 1//p and so y = c l / q . The maximal value of 
f(x,y) = xy is c 1 / p c 1 / gr = c 1 = c. Therefore, everywhere 

xy = f(x, y) <c = x p /p + y q /q . 



SNELLS LAW of refraction is the problem to determine the 
fastest path between two points, if the path crosses a border of 
two media and the media have different indices of refraction. The 
law can be derived from Lagrange: 

PROBLEM. A light ray travels from A = (-1,1) to the point 
B = (1,-1) crossing a boundary between two media (air and 
water). In the air (y > 0) the speed of the ray is V\ = 1 (in units 
of speed of light). In the second medium (y < 0) the speed of 
light is V2 = 0.9. The light ray travels on a straight line from A 
to a point P = (x, 0) on the boundary and on a straight line from 
P to B. Verify Snell's law of refraction sin(#i)/ sin(#2) = V1/V2, 
where 6\ is the angle the ray makes in air with the y axis and 
where 62 is the angle, the ray makes in water with the y axis. 



SOLUTION. Minimize f(x,y) = - x) 2 + y 2 / Vl + 

a/(1 — x) 2 + y 2 /v2 = I1/V1 + I2/V2 under the constraint G{x,y) = 
y = 0. The Lagrange equations show that f x (x, y) = 0. This is al- 
ready Snells law because f x = v 1 2(x+l)/(2l 1 )+v 2 2(l-x)/(2l 2 ) = 
means V1/V2 = sin(#i)/ sin(# 2 )- If vi is larger, then B\ is larger. 





V 1 


(-1,1) 




1 \. 






\(x,y) 




\o 




\ 2 


V 2 


(1.-1) 



Lecture 17: 3/19/2004, 2D INTEGRALS 



Math21a, O. Knill 



HOMEWORK. Section 12.1: 16,26,30,44. Section 12.4: 44 



ID INTEGRATION IN 100 WORDS. If f(x) is a continuous function then f(x) dx can be defined as a 
limit of the Riemann sum f n {x) = \ ^x fe e[a b] f( x k) for n ^ oo with Xk = k/n. This integral divided by 
\b — a\ is the average of / on [a, 6]. The integral can be interpreted as an signed area under the graph 
of /. If f(x) = 1, the integral is the length of the interval. The function F(x) = J a f(y) dy is called an 
ant i- derivative of /. The fundamental theorem of calculus states F'(x) = f(x). Unlike the derivative, 
anti-derivatives can not always be expressed in terms of known functions. An example is: F(x) = J* e~ l dt. 
Often, the anti-derivative can be found: Example: f(x) = cos 2 (x) = (cos(2x) + l)/2,F(x) = x/2 — sin(2a?)/4. 



AVERAGES=MEAN. www.worldclimate.com gives the follow- 
ing data for the average monthly rainfall (in mm) for Cam- 
bridge, MA, USA (42.38 North 71.11 West, 18m Height). 
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 
93.9 88l3 83^3 671) 4Z9 26^4 2T9 83^8 35^5 6L4 166.8 82^8" 
The average 860.3/12 = 71.7 is a Rieman sum integral. 



2D INTEGRATION. If f(x,y) is a continuous function of two variables on a region R, the integral 
J R f(x,y) dxdy can be defined as the limit Xi g.r /(^> w ^h Xij = (i/n,j/n) when n goes to 

infinity. If f(x,y) = 1, then the integral is the area of the region R. The integral divided by the area 
of R is the average value of / on R. For many regions, the integral can be calculated as a double 
integral [f^) f(x, y) dy]dx. In general, the region must be split into pieces, then integrated seperately. 






One can interpret j J R f(x, y) dydx as the volume of solid below the graph of / and above R in the x — y plane. 
(As in ID integration, the volume of the solid below the x-y plane is counted negatively). 



EXAMPLE. Calculate / f R f(x,y) dxdy, where f(x,y) = 4x 2 y 3 and where R is the rectangle [0, 1] x [0,2]. 

f [[ 4x 2 y 3 dy]dx = [ [x 2 y 4 \ 2 ] dx = [ x 2 {16 - 0) dx = 16x 3 /3\l = ^ . 
Jo Jo Jo Jo ^ 



FUBINFS THEOREM. 


fa fc f( x iV) dxd V = fc fa f(V' x ) d V dx - 







TYPES OF REGIONS. 

Ilnf dA = £K$f(*,v) d V d * 
type I region. 

//«/ dA = £j£$f(x,v) dxdy 
type II region. 





EXAMPLE. Let R be the triangle 1>£>0, 1 > y > 0,y < x. Calculate 
J J R e~ x2 dxdy. 

ATTEMPT. Jglfy e~ x2 dx]dy. We can not solve the inner integral because e~ x ^ 
has no anti-derivative in terms of elementary functions. 



Jo [Jo 



dx = J Q xe 



dx 



IDEA. Switch order: 
0.316.... 

A special case of switching the order of integration is Fubini's theorem. 



If you can't solve a 
double integral, try to 
change the order of in- 
tegration! 



QUANTUM MECHANICS. In quantum mechanics, the motion of a particle (like an electron) in the plane is 
determined by a function u(x,y), the wave function. Unlike in classical mechanics, the position of a particle 
is given in a probabilistic way only. If R is a region and u is normalized so that J \u\ 2 dxdy = 1, then 
j R \u(x,y)\ 2 dxdy is the probability, that the particle is in R. 

EXAMPLE. Unlike a classical particle, a quantum particle in a box [0,7r] x [0,7r] can have a discrete set of 
energies only. This is the reason for the name "quantum". If — (u xx + u yy ) = Au, then a particle of mass m has 
the energy E = Xh 2 /2m. A function u(x, y) = sm{kx) sin(m/) represents a particle of energy (k 2 + n 2 )% 2 / (2m). 
Let us assume k = 2 and n = 3 from now on. Our aim is to find the probability that the particle with energy 
13h 2 /{2m) is in the middle 9'th R = [7r/3,27r/3] x [7r/3,27r/3] of the box. 
SOLUTION: We first have to normalize u 2 (x,y) = sin 2 (2a;) sin 2 (Sy), so 
that the average over the whole square is 1: 



Jo Jo 



(2x) sin (3y) dxdy . 



To calculate this integral, we first determine the inner integral 
Jq sin 2 (2x) sin 2 (3?/) dx = sin 2 (3y) J™ sin 2 (2a?) dx = |sin 2 (3?/) (the fac- 
tor s'm 2 (3y) is treated as a constant). Now, A = Jq(tt/2) sin 2 (3y) dy = 
^j-, so that the probability amplitude function is f(x,y) = 
^sm 2 (2x) sm 2 (3y). 

The probability that the particle is in R is slightly smaller than 1/9: 

4 ,2tt/3 ,2tt/3 

= — I / sin 2 (2x) sin 2 (3y) dxdy 

7T 2 Jn/3 Jn/3 

= ^{4x - sin(4a;))/8| 2 r // 3 (6a; - sin(6x))/12|^/ 3 



\Jj( x iy) dxd y 



= 1/9- l/(4\/37r) 
The probability is slightly smaller than 1/9. 




WHERE DO DOUBLE INTEGRALS OCCUR? 

- compute areas. 

- compute averages. Examples: average rain fall or average population in some area. 

- probabilities. Expectation of random variables. 

- quantum mechanics: probability of particle being in a region. - find moment of inertia j j R (x 2 -\-y 2 )p(x,y)dxdy 

- find center of mass (j f R xp(x,y) dxdy/M,J J R yp(x,y) dxdy/M), with M = j j R dxdy. 

- compute some ID integrals. 



TRIPLE INTEGRALS are defined similarly and covered later in detail. 
j J J R 1 dxdydz is a volume. 



Fubinis theorem generalizes. 



Lecture 18: 7/22/2003, APPLICATIONS 



Math21a, O. Knill 



HOMEWORK. Section 12.2: 26,36,44, Section 12.5: 14 



THINGS TO KEEP IN MIND. 

• Double integrals can often be evaluated 
through iterated integrals 



" Integrals have layers" . 



• JJ R 1 dxdy = JJ 1 dA is the area of the 
region R. 

• JJ R f(x,y) dxdy is the volume of the solid 
having the graph of / as the "roof and the R 
in the xy— plane as the "floor". 




TYPES OF REGIONS. 

/ J R f dA = J a J c f(x, y) dydx rectangle. 

J J R f dA = fjj*W f(x, y) dydx type I region. 
JJ R fdA = J b a f(x, y) dxdy type II region. 

A general region, we try to cut it into pieces, where each piece is a Type I or Type II region. 




Rectangle 



Type I 



Type II 



To cut 



AREA A = JJ R 1 dA VOLUME V = JJJ R 1 dV 

MASS M = ff R p{x, y) dA MASS M = JJ R p(x, y, z) dV 

AVERAGE JJ R f(x, y) dA/A. AVERAGE Jff R f(x, y, z) dV/V. 

CENTROID CSS R x dA/A, JJ R y dA/A). CENTROID (JJJ R x dV/V, JJJ R y dV/V, JJJ R z dV/V). 

CTR MASS (ff R xp(x, y)dA, JJ R yp{x, y)dA)/M. COM. {JJJ R xpdV, JJJ R xpdV, JJJ R ypdV)/M. 

MOMENT OF INERTIA I = ff R (x 2 + y 2 ) dA MOMENT OF INERTIA I = $ff R {x 2 + y 2 ) dV 

RADIUS OF GYRATION y/T/M RADIUS OF GYRATION \/IJM 



AREA OF CIRCLE. To compute the area of the circle of radius r, we integrate 



-LC 



dydx . 



The inner integral is 2\/r 2 — x 2 so that 



rr 

A= 2 \Jr 2 — x 2 dx . 

This can be solved with a substitution: x = rs'm(u),dx = rcos(w). With the new bounds a = — 7r/2,6 = tt/2 



and \Jr 2 — x 2 = yj 1 r 2 — r 2 sin 2 (w) = rcos(w) we end up with 



2r 2 cos 2 (w) du= r 2 (l + cos(2w)) du = v 2 tt . 

J-n/2 



MOMENT OF INERTIA. Compute the kinetic energy of a square iron plate R = [-1, 1] x [-1, 1] of density p = 1 
(about 10cm thick) rotating around its center with a 6'000rpm (rounds per minute). The angular velocity speed 



is u = 2tt e'000/60 = 100 2tt. Because E = J J R {rcu) 2 /2 dxdy, where r = ^ x 2 + y 2 , we have E = uj 2 I/2 
where / = J J R (x 2 + y [ 

cj 2 r'~ • ' ;,: " > ' 



dxdy is the moment of inertia. For the square, 1 = 8/3. Its energy of the plate is 
= 47r 2 100 2 8/6 Joule ~ 0.86KWh. You can run with this energy a 60 Watt bulb for 14 hours. 



A FLOWER. What is J J R x 2 + y 2 dxdy, where R is a flower obtained by ro- 
tating the region enclosed by the curves y = x 2 and y = 2x — x 2 by adding 
multiples of the angles 27r/12? 

SOLUTION. The moment of inertia of all the petals add up: / = 
12 £ i 2 x r x \x 2 + y 2 ) dydx = J^[x 2 y + y z 1^=^ dx = 1243/210 = 86/35. 



* 



PROBLEM: BERTRAND'S PARADOX (Bertrand 1889) 

We throw randomly lines onto the disc. What is the probability that the 
intersection with the disc is larger than the length \/3 of the equilateral triangle 
inscribed in the unit circle? 

Answer Nr 1: take an arbitrary point P in the disc. The set of lines which 
pass through that point is parametrized by an angle 4>. In order that the chord 
is longer than \/3, the angle has to fall within an angle of 60° of a total of 180°. 
The probability is 1 1/3 L 





Answer Nr 2: consider all lines perpendicular to a fixed diameter. The chord 
is longer than \/3, when the point of intersection is located on the middle half 
of the diameter. The probability is 1 1/2 | . 



Answer Nr. 3: if the midpoint of the intersection with the disc is located in 
the disc of radius 1/2 with area 7r/4, then the chord is longer than -\/3- The 
probability is 1/4 . 



The paradox comes from the choice of the probability density function f(x,y). In each case, there is a 
distribution function f(x,y) which is radially symmetric 
The constant distribution f(x,y) = l/ir 



is obtained when we throw the center of the line into the disc. The 



disc A r of radius r has probability r /tt. The density in the r direction is 2r. 
The distribution 



f(x, y) = 1/r is obtained when throwing parallel lines. This will put more weight to center. 



The disc A r of radius r has probability of A r is bigger than the area of A r . The density in the r direction is 
constant equal to 1. 

Lets compute the distribution when we rotate the line around a point at the boundary. We hit a disc A r of 
radius r with probability F(r) = arcsin(r)2/7r. The density in the radial direction is f(r) = 2/ (7Vy/(l — r 2 )) 



COMPARISON OF THE DEN- 
SITY FUNCTIONS. A plot of the 
radial distribution functions f(r) as 
well as F(r), the probability of A r 
shows why we get different results 
for F(l/2). 





What happens if we really do an experiment and throw randomly lines onto a disc? The outcome of the 
experiment will depend on how the experiment will be performed. If we would do the experiment by hand, 
we would probably try to throw the center of the stick into the middle of the disc. Since we would aim to the 
center, the distribution would probably be different from any of the three solutions discussed above. 



STATISTICS. If f(x,y) is a probability distribution on R: f(x,y) > 0, JJ R f{x,y) dA = 1, then E[X] = 
J X(x,y)f(x,y) dA is called the expectation of X, Var(X) = E[(X — E[X]) 2 ] is called the variance and 
<j{X) = Y^(Var[X]) the standard deviation. 



Lecture 19: 3/24/2004, POLAR INTEGRATION 



Math21a, O. Knill 



HOMEWORK. Section 12.3: 10,20,32,42 



POLAR COORDINATES. A point (x,y) 
in the plane has the polar coordinates 

r = \J x 1 + y 2 , 9 = axo,tg{y/x). We have 
x = rcos(#), y = rsin(0). 

Footnote: Note that 9 = arctg(y/x) defines the angle 
6 only up to an addition of 7r. The points (x,y) and 
(— x, — y) would have the same 6. In order to get the 
correct 6, one could take arctan(y/x) in (— 7r/2,7r/2] as 
Mathematica does, where it/2 is the value when y/x = 
oo, and add it if x < or x = 0, y < 0. In Mathematica, 
you can get the polar coordinates with 
(r,0) = (Abs[x + Jy],Arg[x + Jy]). 



POLAR CURVES. A general polar curve is written as (r(£), #(£)). It can be translated into x,y coordinates: 
x(t) = r(t) cos(0(£), y(t) = r(t) sin(0(*)). 

POLAR GRAPHS. Curves which are graphs when written in polar coordinates are called polar graphs. 
EXAMPLE. r(9) = cos(3#) is the which belongs to the class of roses r(t) = cos(nt). 

EXAMPLE. If y = 2x + 3 is a line, then the equation gives rsin(#) = 2rcos(#) + 3. Solving for r(t) gives 
r(9) = 3/(sin(0 — 2cos(0)). The line is also a polar graph. 



EXAMPLE. The polar form r{9) - 



of the ellipse (see Kepler). The ellipse is a polar graph. 



POLAR COORDINATES. For many regions, it is better to use polar coordinates for integration: 



/ Ir f( x ' y) dxd v = I Ir #( r ' e ) r drde 



For example if f(x, y) = x 2 + x 2 + xy, then g(r, 9) = r 2 + r 2 cos(9) sin(0). 



EXAMPLE. We had computed area of the disc {x 2 + y 2 < 1} using substitution. Was quite a mess. It is easier 
to do that integral in polar coordinates: 



/ / r drd9 = 27rr 2 /2\l = 7T . 
Jo Jo 



WHERE DOES THE FACTOR "r" COME FROM? 



A small rectangle R with dimensions 
d9dr in the (r, 9) plane is mapped by 
T:(r,0) 

(rcos(#),rsin(#)) to a sector segment 
S in the (x, y) plane. It has approxi- 
mately the area rd9dr. It is small for 
small r. 




AN OTHER EXPLANATION (for people who have seen some linear algebra). The map translating from polar 
coordinates to Cartesian coordinates (x,y) = T(r,9) = (r cos(0), r sin(0)) has as a linear approximation the 
matrix 

cos(#) sin(0) 
-rsin(#) rcos(#) 



DT(x,y)-- 



which has determinant r. A small rectangle in the (r, #)-plane of area dA will have in the (x, y) plane the area 
rdA. 



ROSES. We can now integrate over type I or type II regions in the (9, r) plane. Examples are roses: 
{{9,r) |0 < r < f(9)} where f(9) is a periodic function of 9. 





The region R in the 9 — r coordinates is a type 
I region 



The region S = T{R) in the x — y coordinates 
is neither a type I nor a type II region. 



EXAMPLE. Find the area of the region {(0,r(0)) | r(0) < | cos(30)| }. 

' ' cos(30) 2 



f f y dxdy = f " f° S rdrd9= T cos ^ 6 L d9 = w / 2 
J J r Jo Jo Jo 2 



EXAMPLE. Integrate f(x, y) = y\Jx 2 + y 2 over the semi annulus R — {(x, y) \ 1 < x 2 + y 2 < 4, y > }. 

Solution. 2 2 44 

( [ rsm(0)rrdOdr= [ r 3 [ sin(0) d9dr = ^ ~ 1 ^ [ sin{9)d9 = 15/2 
Ji Jo Ji Jo 4 J 

For integration problems, where the region is part of an annulus, or if you 
see function with terms x 2 + y 2 try to use polar coordinates x = rcos(9),y = 
r sin(0). 



THE SUPER-CURVE. The Belgian Biologist Johan Gielis 
came up in 1997 with the family of curves given in polar 
coordinates as 



r(0): 



|cos(^)p 



[ ^ r j i| 
:=Abs[Sin[ni t/4]/b]"n2; 
= (rl [t]+r2 [t] )-(i/n3); 

= If [R[t] ==0,(0,0] ..{Cos [t] r Sin[t] )/R[t]]; 
5tricPlot[r [t] , (t,-2Pi,2Pi),PlotRange->All, 



:eger, {3,20]]; RR:=20 R 



It is called the super-curve because it can produce a vari- t-U\ 
ety of shapes like circles, square, triangle, stars. It can also 
be used to produce "super-shapes" (see later). 

The super-curve generalizes the superellipe which had been discussed in 1818 by Lame and helps to tackle 
one of the more intractable problems in biology: describing form. A twist: Gielis has patented his discovery 
described in "Gielis, J. A 'generic geometric transformation that unifies a wide range of natural and abstract 
shapes'. American Journal of Botany, 90, 333 - 338, (2003). To the right you see the Mathematica code. 



4/26/2004, 3D INTEGRATION 



Math21a, O. Knill 



3D INTEGRATION. If f(x,y,z) is a function of three variables and R is a region in space, then 
J J J R f(x, y, z) dxdydz is defined as the limit of Riemann sum ^ J2(xi,yj,z k )eR f( x i> Vh z k) f° r n ~^ °°> where 
(x i ,y j ,z k ) = (i,i,$). ^ 

TRIPLE INTEGRALS. As in two dimensions, triple integrals can be evaluated through iterated ID integrals. 
EXAMPLE. Assume R is the box [0, 1] x [0, 1] x [0, 1] and f(x,y, z) = 2Ax 2 y 3 z. 



Jq 1 2Ax 2 y 3 z dx dz . 

CALCULATION. We start from the core J* 24x 2 y 3 z dz = 12x 3 y 3 , then integrate the middle layer: 



J Q 12x y dy = 3x and finally handle the outer layer: J Q 3x dx =1. 

WHAT DID WE DO? When we calculate the most inner integral, we fix z and y. The integral is the average of 
f(x,y,z) along a line intersected with the body. After completing the second integral, we have computed the 
average on the plane z = const intersected with R. The most outer integral averages all these two dimensional 
sections. 

VOLUME UNDER GRAPH. The volume under the graph of a function f(x,y) and above a region R 
[a, 6] x [c,d\ is the integral f(x,y) dxdy. Actually, this is a triple integral: 

/b rd rf(x,y) 
J J 1 dzdxdy . 

An integral of the form fg^y) f( x ' ^' z ) dzdxdy is sometimes also called a type I tipple integral. 

VOLUME OF THE SPHERE. (We will do this more elegantly later). The volume is 

r r r r 1 rVi-x 2 ^\-x 2 -y 2 
V= dxdydz = / [/ [/ dz]dy]dx 

J J JR J -I J-Vl-x 2 J-y/l-x 2 -y 2 

After computing the inner integral, we have V = 2 J^-J/_^/^r^(l — x 2 — y 2 ) x / 2 dy\dx. 

To resolve the next layer, call 1 — x 2 = a 2 . The task is to find f" a \/ a 2 — x 2 dx. Make the substitution 

x/a = sm(u),dx = acos(w) to write this as a j o arcsm ( a / a ) ^1 — sm 2 (u)acos(u)du = a 2 2 cos 2 (u)du = a 2 ir/2. 
We can finish up the last integral 

V = 2ir/2 J (1 - x 2 ) dx = 4tt/3 . 

MASS OF A BODY. In general, the mass of a body with density 
p(x, y, z) is J J J R p(x, y, z) dV . For bodies with constant density p 
the mass is pV, where V is the volume. Compute the mass of a body 
which is bounded by the parabolic cylinder z = 4 — x 2 , and the planes 
x — 0, y — 0, y = 6, z — if the density of the body is 1. 



/ / / dz dy dx = / (4 - x 2 ) dydx 
Jo Jo Jo Jo Jo 

6 f (4 - x 2 ) dx = 6(4ic - x 3 /3\ 2 = 32 
Jo 




Kjtiirs i Hiix ur ivi/\oo. Lyompute tne center or 
(24/32, 96/32, 256/180) = (3/4, 3, 8/5): 


mass of the same 


body. The center of mass is 


rZ no — X rZ rO 

/ / / x dz dy dx = / x(4 — x 2 ) 
Jo Jo Jo Jo Jo 


dydx = 6 / x(4 — x 2 ) 
Jo 


drr = 24rr 2 /2-6a; 4 /4|o = 24 


/ / / ydzdydx= / y(4 - x 2 ) 
Jo Jo Jo Jo Jo 


r 2 

dydx= / 18(4 - x 2 
Jo 


dx = 18{4x - x 3 /3)\l = 96 


p2 pQ p4 — x 2 p2 pG 

/ / / zdzdydx= / (4 - x 2 ) 2 /2 dydx 
Jo Jo Jo Jo Jo 


= 6 f ! (4 - x 2 ) 2 12 dx 
Jo 


= 3{16x - 8x 3 /3 + x 5 /5)|o = 256/5 



SOME HISTORY OF COMPUTING VOLUMES. How did people come up calculating the volume 
j J J R 1 dxdydz of a body? 



Archimedes ((-287)-(-212): Archimedes's method of integration al- 
lowed him to find areas, volumes and surface areas in many cases. His 
method of exhaustion paths the numerical method of integration by 
Riemann sum. The Archimedes principle states that any body sub- 
merged in a water is acted upon by an upward force which is equal to the 
weight of the displaced water. This provides a practical way to compute 
volumes of complicated bodies. 

Cavalieri (1598-1647): Cavalieri could determine area and volume 
using tricks like the Cavalieri principle. Example: to get the volume 
the half sphere of radius R, cut away a cone of height and radius R from 
a cylinder of height R and radius R. At height z, this body has a cross 
section with area R 2 ir — r 2 ir. If we cut the half sphere at height z, we 
obtain a disc of area (R 2 — r 2 )ir. Because these areas are the same, the 
volume of the half-sphere is the same as the cylinder minus the cone: 
irR 3 - ttR 3 /3 = 2irR 3 /3 and the volume of the sphere is AttR 3 /3. 

Newton (1643-1727) and Leibniz( 
1646-1716): Newton and Leibniz, 
developed calculus independently. The 
new tool made it possible to compute 
integrals through " anti-derivation" . 
Suddenly, it became possible to find 
integrals using analytic tools. 



MONTE CARLO COMPUTATIONS. Here is an other way to compute integrals: Suppose we want to calculate 
the volume of some body R inside the unit cube [0, 1] x [0, 1] x [0, 1]. The Monte Carlo method is to shoot 
randomly n times onto the unit cube and count the fraction of times, we hit the solid. Here is an experiment 
with Mathematica and where the body is one eights of the unit ball: 

R := RandomQ; k = 0; Do[x = R; y = R; z = R; If[x 2 + y 2 + z 2 < 1, k + +], {10000}]; k/10000 

Assume, we hit 5277 of n=10000 times. The volume so measured is 0.5277. The actual volume of 1/8'th of the 
sphere is tt/6 = 0.524. For n — > oo the Monte Carlo computation gives the actual volume. 

WHERE CAN TRIPLE INTEGRALS OCCUR? 

• Calculation of volumes V = J J J R 1 dV and masses M = J J J p dV. 

• Finding averages J J J R f dV/ J J J R 1 dV. Examples: average algae concentration in a swimming pool. 

• Determining probabilities. Example: quantum probability J J f R f(x,y, z) 2 dxdydz. 

• Moment of inertia J J J R r(x, y, z) 2 p(x, y, z) dV, where r(x : y : z) is the distance to the axis of rotation. 

• Center of mass (/ / J R xp dV/Mj J f R yp dV/M, f J J R zpdV/M). 






3/26/2004, CYLINDRICAL COORDINATES 



Math 21a, O. Knill 



REMINDER: INTEGRATION POLAR COORDINATES. 



EXAMPLE 1. Area of a disk of radius R 



ft 

Jo Jo 



r dOdr = 27Ty|f = R 2 ir . 



WHERE DOES THE FACTOR "r" COME FROM? 

1. EXPLANATION. A small rectangle with dimensions dOdr in the (r, 0) plane is mapped to a sector segment 
in the (x, y) plane. It has approximately the area rdOdr. It is small for small r. 

2. EXPLANATION. The map (r,0) i-> (r cos(0,r sm(0))(f(r,0), g(r,0)) which changes from Cartesian to polar 

coordinates has the Jacobean is T' = C °. S ^fi x S * n ^L = \ ^ r ^ with determinant f r qe — feQr = r. 

[ rsm(0) rcos(#) J [ g r g e J J y J y 

This is a special case of a more general formula which we do not cover in this class. 



T : (r,0,z) 
r cos(6>), r sin(0), z) 




CYLINDRICAL COORDINATES. Use polar coordinates in the x-y 
plane and leave the z coordinate. Take T(r,0,z) = (r cos(0), r sin(0), z). 
The integration factor r is the same as in polar coordinates. 



/ / / f{x,y,z) dxdydz = iff g(r 1 1 z)\v]drd0dz . 
J J JT^FV) j j j r 

For example, if f(x, y, z) = (x 2 + y 2 ) + xz, then g(r, 0, z) = r 2 + 
r cos(0)2;. 




EXAMPLE. Calculate the volume bounded by the parabolic z = 1 — (x 2 + ?/ 2 ) and the rr — ?/ plane. In cylindrical 
coordinates, the paraboloid is given by the relation z = 1 — r 2 : 



/7T 

JO JO JO 



- dzd0dr 



rr 

Jo Jo 



(r - r 3 ) «r = 2vr(r 2 /2 - r 4 /4)J = tt . 



USE A GOOD PICTURE! A good conceptual picture not only helps to solve double and triple integral problems. 
Sometimes, it is even virtually impossible to solve the problem without having a good picture. 



PROBLEM. Find the volume of the solid obtained by taking a sphere 
x 2 + y 2 + z 2 = 1 into which a hole x 2 + y 2 < 1/2 has been drilled. 



SOLUTION. 



Jl/2 J-Vl^ 



rdzdr = 2ir 
1/2 J-Vi-r 2 Ji/: 



which is 



-27r-(l-r 2 )- , 1/2 



f 2r/ 

Jl/2 



1 — r 2 dr 



3 8 



PROBLEM. Find the volume of the intersection of the three cylinders 
x 2 + y 2 < 1, x 2 + z 2 < 1 and y 2 + z 2 < 1. 



SOLUTION. 



W4 ^1 

8 / / yjl-r 2 sin(#) 2 rdrd0 
J-tt/4 Jo 

which is -16/3 + 8x72. 




PROBLEM. Find the volume of the intersection of the two solid 
cylinders x 2 + y 2 < 1 and x 2 + z 2 < 1. 

SOLUTION. We compute the volume in one of the 8 octants and 
multiply by 8 in the end. 



8 / f yjl - r 2 cos(0) 2 rd0dr = 16/3 . 
Jo Jo 

Here is how one would evaluate this integral with Mathematica: 

8 Integrate [Sqrt[l - r 2 Cos[theta] 2 ]r, {r, 0, 1}, {theta, 0, Pi/2}] 

Note the order in which the integration range in entered the computer 
algebra system! 



PROBLEM. Find J J J R z 2 dV, where R of the solid obtained by 
intersecting {1 < x 2 + y 2 + z 2 < 4} with the double cone {z 2 > x 2 -\-y 2 }. 

SOLUTION. We split the integral up into a "cone part" z G [-V%, V%], 
and the cup part \z\ > y/2 and evaluate each seperatly. The double 
cone has the volume 2tt §^/2 §o r ^ r< ^ z = 27r\/2/3. One cup has the 
volume 2tt §q^~ Z r drdz = it J^i^ — z 2 ) dz. The total volume is 
tt(2 - \/2)16/3. 



4/5/2004, SPHERICAL INTEGRALS 



Math21a, O. Knill 



HOMEWORK: Section 12.6: 14,18,32,34,48,62 

REMINDER: POLAR AND CYLINDRICAL COORDINATES. 



J J f(r,6)\r}Wdr . 



Cylindrical coordinates are obtained by taking polar coordinates in 
the x-y plane and leave the z coordinate. With T(r, 9, z) = 
(r cos(0), r sin(#), z). the integration factor r is the same as in polar 
coordinates. 




/ / / f(x,y,z) dxdydz = iff g{r,9, z) \v]drd9dz 

J J JTiR) J J J Ft 



SPHERICAL COORDINATES. Spherical coordinates use p, the distance to the origin as well as two angles: 
the polar angle and 0, the angle between the vector and the z axis. The coordinate change is 

T : (x, y, z) = (pcos(#) sin(0), ps'm(9) sin(0), pcos(0)) . 

The integration factor can be seen by measuring the volume of a spher- 
ical wedge which is dp, psin(0) d9, pdcf) = p 2 sin(( 



/ / / f (x,y, z) dxdydz = / / / g{p,@,z) p 2 sin(0) dpd9dcf) 
J J JT{R) J J Jr 1 1 




■?4 



T:(p,9,<j>) 

(pcos(#) sin(0), 
psin(#) sin(0), 

PCOS(0)) 




VOLUME OF SPHERE. A sphere of radius R has the volume 

/ / / p 2 sin(0) dcf)d9dp . 
Jo Jo Jo 



The most inner integral J™ p 2 sin(0)d0 = — p 2 cos(0)|o = 2p 2 . 
The next layer is, because 4> does not appear: J Q 277 2p 2 d(f> = iirp 2 . 
The final integral is 47rp 2 dp = 4ttR 3 /3. 





MOMENT OF INERTIA. The moment of inertia of a body 
G with respect to an axis L is defined as the triple integral 
/ / Jg 7 *^'^' 2 ) 2 dzdydx, where r(x,y,z) = Rs'm((p) is the 
distance from the axes L. For a sphere of radius R we 
obtain with respect to the z-axis: 



Jo Jo Jo 



sin 2 (0)p 2 sin(0) d(f)d9dp 



r R ,2j 

/ P A dr){ 
Jo Jo 



- i 5! 2tt - SirR5 - ^5- 
~ 3 " ~5~ ' n ~ 15 ~ ~T~ 

If the sphere rotates with angular velocity uj, then Iu 2 /2 
is the kinetic energy of that sphere. Example: the mo- 
ment of inertia of the earth is 810 37 kgm 2 . The angular 
velocity is uj = 1/day = l/(86400s). The rotational en- 
ergy is 810 37 /cpm 2 /(7464960000s 2 ) - 10 28 J = 10 25 /cJ - 
2.510 24 A:caL 

How long would you have to run on a treadmill to accumulate this energy if you could make 2'500 kcal/hour? 
We would have to run 10 21 hours = 3.6 1 24 seconds. Note that the universe is about 10 17 seconds old. If all the 
6 million people in Massachusetts would have run since the big bang on a treadmill, they could have produced 
the necessary energy to bring the earth to the current rotation. To make classes pass faster, we need to spin 
the earth more and just to add some more treadmills ... To the right you see a proposal for the science center. 




DIAMOND. Find the volume and the center of mass of a diamond, the 
intersection of the unit sphere with the cone given in cylindrical coordinates 
as z = V3r. 



Solution: we use spherical coordinates to find the center of mass {x,y, 

V = fo iT fo /6 P 2 sin (^) d ^ dM P = {± ^ 1<1 ' K y = Jo J 2n J n/6 p 3 sin 2 (0) sin(0) d^dp/V = 

* = Jo C Io /6 P 3 sin2 (^) cos (#) d(pdOdp/V = 5 = j Q i j Q 2- j-/6 p3 cqs ^ gin( ^ d(f)d9dp/v = 



PROBLEM Find J J J R z 2 dV for the solid obtained by inter- 
secting {1 < x 2 +y 2 +z 2 < 4} with the double cone {z 2 > x 2 +y 2 }. 

Solution: since the result for the double cone is twice the result 
for the single cone, we work with the diamond shaped region R in 
{z > 0} and multiply the result at the end with 2. 
In spherical coordinates, the solid R is given by 1 < p < 2 and 
< cf) < 7r/4. With z = pcos(0), we have 

jf C IT pi cos2 (0) sin w d * d8dp = ( ¥ " ¥ )2 ' r( ZC T M | ° / 




The result for the double cone is 4tt(31/5)(1 - 1/V2' 
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We first calculate the volume of a sphere of radius R in different ways. Then we show how to calculate the 
volume of the torus in three different ways. The page serves more as an illustration for the variety of tools 
which are available. We did not cover all the material (yet) to understand all of the methods. 



SEVEN WAYS TO COM- 
PUTE THE VOLUME OF 
THE SPHERE 




1) IN RECTANGULAR COORDINATES. The volume is 



f f f f L rVL 2 -x 2 py / L 2 -x 2 -y 2 

V = / / / dxdydz = / dzdydx 

J J JR J-L J-^L 2 -x 2 J -JL 2 -x 2 -y 2 



After computing the most inner integral, we have V = J^ L /^/pz^ 2(L 2 — x 2 — y 2 ) 1 ^ 2 dydx. Define a 2 = L 2 — x [ 
and use the integral in the box below: 

V = 7T J (L 2 - x 2 ) dx = 2ttL 3 - 2ttL 3 /3 = 4ttL 3 /3 . 



Substitution ^ — sin(w),<±c = acos(u)du gives: 


S- a Va 2 — x 2 dx 


— I-i 2 /2 a y 1 — sin 2 (w) a cos (w) du = a 2 J^ 2 2 cos 2 (u)du = 


a 2 ir/2 





2) IN CYLINDRICAL COORDINATES. At height z, we parameterize a disc of radius L 2 - z 2 , so that the 
integral is 

/L p\/ Li 2 —Z 2 pI'K pL 
/ / rdOdrdz = (L 2 - z 2 )2irdz = 4ttL 3 - 2ttL 3 /3 = 4ttL 3 /3 . 

-L JO JO J-L 

3) IN SPHERICAL COORDINATES. 

pR p2TT pTT pR pTT 

/ / p 2 sin(0) d(f>d0dp = 2tt / p 2 dp sin(0) d<j> = 4ttL 3 /3 . 

Jo Jo Jo Jo Jo 



4) WITH CAVALIERI. Cavalieri cuts the hemisphere at height z to obtain a disc of radius \/L 2 — z 2 with area 
tt(L 2 — z 2 ). He looked now at the complement of a cone of height L and radius L which when cut at height z 
gives a ring of outer radius L and inner radius z. The ring has area tt(L 2 — z 2 ). Cavalieri concludes (Cavalieri 
principle) that the volume of that body is the same as the volume of the hemisphere. Since the difference of 
the volume of the cylinder and the cone which is ttL 3 — 7rL 3 /3 the hemisphere has the volume 2ttL 3 /3 and the 
sphere has volume 4ttL 3 /3. 



5) CAS. Integrate^, {x, —L, L}, {y, -Sqrt[L 2 - x% Sqrt[L 2 - x 2 ]}, {z, -Sqrt[L 2 - x 2 - y 2 ], Sqrt[L 2 - x 2 - y 2 ]}] 



6) IN LAS VEGAS. The Monte Carlo Method is to shoot randomly onto the cube [-L, L] x [-L, L] x [-L, L] 
and see how many times we hit the sphere. Here an experiment with Mathematica: 

R := (2Random[] - 1); k = 0; Do[x = R; y = R; z = R; If[x 2 + y 2 + z 2 < 1, k + +], {10000}] ; k/10000 

Assume, we hit 5277 of 10000 the measured fraction of the volume of the sphere with the volume of the cube 8 
is 0.5277. The volume of 1/8'th of the sphere is tt/6 = 0.524 



7) USING GAUSS THEOREM (see later) The vector field F(x, y, z) = (x, y, z) has divergence 3 Gauss theorem 
tells that 3V is the flux of the vector field through the surface which is L times the surface area 47rL 2 . Therefore, 
V = 4ttL 3 /3 



FIVE WAYS TO COM- 
PUTE THE VOLUME OF 
THE TORUS 




1) WITH TORAL COORDINATES. 

T(r, 9, cp) = (x, y, z) = ((6 + r cos(0)) cos(0), (6 + r cos(0)) sin(^), r sin(0)) parameterizes the torus. 

The Jacobean is det(T') = g^'g'^j = r(6 + rcos(0). The torus is the image of the cube [0, a] x [0,27r] x [0, 27r] 
under the map T. The change of variables formula gives 



i(( r(b + r cos(0)) d$d9dr = (27r)(27r) f br dr -- 
Jo Jo Jo Jo 



2) USING CYLINDRICAL COORDINATES. 
If we fix the z coordinate, we obtain an annulus with inner radius b — \J a 2 — z 2 and outer radius b + y/a 2 



This annulus has the area 7r(6+ v 'a 2 — z 2 ) 2 — 7r(6 — \/a 2 — z 2 ) 2 . Therefore, the volume is 47T& J" a \/a 2 — z 2 dz = 
47r6(7ra/2) = 27r 2 a 2 6. 



3) USING PAPPUS CENTROID THEOREM. "The volume of a solid of revolution generated by the revolution 
of a region S in the x — z plane around the z axis is equal to the product of the area of S and the arc length 
27T& of the circle on which the center of S moves" . 

In the case of the torus, the length of the curve is 2irb. The area of the lamina is A = ira 2 . Therefore, the 
volume is 2-K 2 a 2 b. 

PROOF OF THE CENTROID THEOREM. We use a coordinate change transformation. In Polar coordinates, 
the lamina S with center of mass (6, c) is parametrized by r and z. Introduce new coordinates T(u, v) = (u+b, v+ 
c) = (r, z) so that (0, 0) is the center of mass in the new coordinates. The Jacobean of this coordinate change is 1. 
The volume of the solid of revolution is V = (2tt) J J s r drdz = (2tt) j f R (u + b) dudv = 2irb J J b dudv = 2irbA, 
where we used that j j R u dudv = because (u,v) = (0,0) = (/ J R u dudv/ A, j j R v dudv/ A) is the center of 
mass of R. 



4) MONTE CARLO AGAIN. Lets assume 6 = 2 and a = 1. If (x, y, z) is a random point in [-3, 3] x [-3, 3] ) 
[—1, 1] then (r — 2) 2 + z 2 < 1 is the condition to be in the torus, where r 2 = x 2 + y 2 ■ 

Assume, we hit 5484 of 100000 the measured fraction of the volume 72 of the box we estimate 72*5484/100000 = 
3.94848 for the actual volume 4vr 2 = 39.4784. 



5) CAS. Integrate [r, {r, 1, 3}, {theta, 0, 2Pi}, {z, -Sqrt[l - (r - 2) 2 ], Sqrt[l - (r - 2) 2 ]}] 
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VECTOR FIELDS. 



* « < << t y ► * 



A "I A 
[ 1 

A vector field in the plane is a map, which assigns to each point (x, y) A. 4 4 



Planar vector field. ' ^ 



► V 



in the plane a vector F(x, y) = (M(x, y), N(x, y)). 

* * I V ^ T 
Vector field in space. ^ V > V y 

A vector field in space is a map, which assigns to each point (x, y, z) in •< A ► 

space a vector F(x, y, z) = (Mix, y, z), N(x, y, z), P(x, y, z)). 



PLANAR VECTOR FIELD EXAMPLES. > * A * < 



A ▼ Y ^ 



1) F(x,y) = (y, —x) is a planar vector field which you see in a picture 
on the right. 

2) F(x, y) = (x - 1, y)/((a: - l) 2 + y 2 ) 3 / 2 - (x + 1, y)/((a; + l) 2 + y 2 ) 3 / 2 
is the electric field of positive and negative point charge. It is called 



► * * 1 1 ► t * 



•1 



4 < * 

, , 1 < M 

dipole field. It is shown in the picture above. ^ . w w * * * 



GRADIENT FIELD. 2D: If f(x,y) is a function of two variables, then F(x,y) = Vf(x,y) is called a gradient 
field. The same in 3D: gradient fields are of the form F(x, y, z) = Vf(x, y, z). 

EXAMPLE. (2x, 2y, —2z) is the vector field which is orthogonal to hyperboloids x 2 + y 2 — z 2 = const. 

EXAMPLE of a VECTOR FIELD. If H(x, y) is a function of two variables, then (H y (x, y), -H x (x, y)) is called 
a Hamiltonian vector field. An example is the harmonic Oscillator H(x,y) = x 2 + y 2 . Its vector field 
(H y (x,y), —H x (x,y)) = (y, —x) is the same as in example l) above. 



WHEN IS A VECTOR FIELD A GRADIENT FIELD (2D)? 



F(x,y) = (M(x,y), N(x,y)) = Vf(x,y) implies N x (x,y) = M y (x,y). If this does not hold at some point, F is 
no gradient field. We will see next week that the condition curl(.F) = N x — M y = is also necessary for F to 
be a gradient field. 



EXAMPLE. VECTOR FIELDS IN BIOLOGY. 

Let x(t) denote the population of a "prey species" like tuna fish and y(t) is the population size of a "predator" 
like sharks. We have x'{t) = ax(t) + bx{t)y{t) with positive a,b because both more predators and more prey 
species will lead to prey consumption. The rate of change of y(t) is —cy(t) + dxy, where c, d are positive. 
We have a negative sign in the first part because predators would die out without food. The second term is 
explained because both more predators as well as more prey leads to a growth of predators through reproduction. 

A concrete example is the Volterra-Lodka system 

x = OAx — OAxy 
y = -Q.ly + Q.2xy 

Volterra explained with such systems the oscillation of fish pop- 
ulations in the Mediterranean sea. At any specific point (x, y) = 
(x(t),y(t)), there is a curve r(t) = (x(t),y(t)) through that 
point for which the tangent r'{t) = (x'(t),y'(t) is the vector 
(0.4a; - OAxy, -OAy + 0.2xy). 




VECTOR FIELDS IN PHYSICS 



Newton's law mf" = F relates the acceleration r" of a body with 
the force F acting at the point. For example, if x(t) is the position 
of a mass point in [—1, 1] attached at two springs and the mass 
is m = 2, then the point experiences a force {—x + (—x)) = —2x 
so that mx" = 2x or x"{t) = —x(t). If we introduce y(t) = x'{t) 
of t, then x'(t) = y(t) and y'(t) = —x(t). Of course y is the 
velocity of the mass point, so a pair (x, y), thought of as an initial 
condition, describes the system so that nature knows what the 
future evolution of the system has to be given that data. 

We don't yet know yet the curve t ^ (x(t), y(t)), but we know the 
tangents (x'(t), y'(t)) = {y{t), —x(t)). In other words, we know a 
direction at each point. The equation (x' = y,y' = —x) is called a 
system of ordinary differential equations (ODE). More generally, 
the problem when studying ODE's is to find solutions x(t),y(t) 
of equations x'(t) = f(x(t),y(t)),y'(t) = g(x(t),y(t)). Here we 
look for curves x(t), y(t) so that at any given point (x, y), the tan- 
gent vector (x'(t), y'(t)) is (y, —x). You can check by differentia- 
tion that the circles (x(t),y(t)) = (r sin(t), r cos(t)) are solutions. 
They form a family of curves. Can you interpret these solutions 
physically? 



AAAA^^AAAA 




VECTOR FIELDS IN MECHANICS 

If x(t) is the angle of a pendulum, then the gravity acting on it produces a force 
F(x) = —gms'm(x), where m is the mass of the pendulum and where g is a constant. 
For example, if x = (pendulum at bottom) or x = tt (pendulum at the top), then 
the force is zero. 

The Newton equation " mass times acceleration = Force" gives 



x(t) = —gsin(x(t)) . 

The equation of motion for the pendulum x(t) = 
with y = x also as 



A 



-gsin(x(t)) can be written 



(It 



(x(t),y(t)) = (y(t),-gsm(x(t)). 




Each possible motion of the pendulum x(t) is described by a curve r(t) = 
(x(t),y(t)). Writing down explicit formulas for (x(t),y(t)) is in this case not 
possible with known functions like sin, cos, exp, log etc. However, one still can 
understand the curves: 

Curves on the top of the picture represent situations where the velocity y is large. They describe the pendulum 
spinning around fast in the clockwise direction. Curves starting near the point (0, 0), where the pendulum is at 
a stable rest, describe small oscillations of the pendulum. 



VECTOR FIELDS IN METEOROLOGY. On maps like 
http://www.hpc.ncep.noaa.gov/sfc/satsfc.gif one can see Isoterms, 
curves of constant temperature or pressure p(x, y) = c. These are level 
curves. The wind maps are vector fields. F(x, y) is the wind velocity 
at the point (x,y). The wind velocity F is not always normal to the 
isobares, the lines of equal pressure p. The scalar pressure field p and 
the velocity field F depend on time. The equations which describe the 
weather dynamics are called the Navier Stokes equations 

d/dtF + F ■ VF = vAF - Vp + /, divF = 

(we will see what is A, div later.) It is a partial differential equation 
like u x — u y = 0. Finding solutions is not trivial: 1 Million dollars are 
given to the person proving that the equations have smooth solutions in 
space. 
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Section 13.1: 16,26, Section 13.2: 14,16,22 



f a F(r(t))-f*(t)dt 



f a F(r(t))-r<(t)dt 



is called the 



is called the 



LINE INTEGRALS. 

2D: If F{x 1 y) is a vector field in the plane and 7 : t 1— > r(t) is a curve, then 
line integral of F along the curve 7. 

3D: If F{x 1 y 1 z) is a vector field in space and 7 : t 1— > r(£) is a curve, then 
line integral of F along the curve 7. 

NOTATION. The short-hand notation f F • <is is also used. In the literature, where curves are sometimes 
written as r(t) = (x(t),y(t), z{t)) or r(£), the notation J F • dr or J F • appears. For simplicity, we leave 
out below the arrows above the r{t) and F(r(t)) even so they are vectors. 

WRITTEN OUT. If F = (M, N) and r = (x(t),y(t)), we can write f a M(x(t), y(t))x'(t) + N(x(t), y(t))y'(t) dt. 

MORE NOTATION. One also can write f*M(x,y) dx + N(x,y) dy. Warning: this later notation is only 
possible for certain type of curves and rather old fashioned. Avoid it. 

EXAMPLE: Work. If F(x, y, z) is a force field, then the line integral J* F(r(t)) ■ r'(t) dt is called work. 

EXAMPLE: Electric potential. If E(x,y,z) is an electric field, then the line integral E(r(t)) ■ r'{t) dt is 
called electric potential. 

EXAMPLE: Gradient field. If F(x,y,z) = \7U(x,y, z) is a gradient field, then as we will see next hour 
J a F(r(t)) ■ r'(t) dt = U(r(b)) — U(r(a)). The gradient field has physical relevance. For example, if U{x 1 y, z) is 
the pressure distribution in the atmosphere, then VU(x, y, z) is the pressure gradient roughly the wind velocity 
field. 




EXAMPLE 1. Let 7 : t 1— ► r(t) = (cos(t), sin(t)) be a circle parametrized by 
t G [0, 2tt] and let F(x, y) = (—y, x). Calculate the line integral I — J F(r)-dr. 

ANSWER: We have / = F(r(t)) ■ r'(t) dt = J^{- cos(t), sin(t)) • 
(- cos(t), sin(t)) dt = cos 2 (£) + sin 2 (£) dt = 2ir 



EXAMPLE 2. Let r(t) be a curve given in polar coordinates as r(t) = cos(£),0(£) = t defined on [0, 7r]. Let F 
be the vector field F(x, y) = {—xy, 0). Calculate the line integral J F ■ dr. 

SOLUTION. In Cartesian coordinates, the curve is r(t) = (cos 2 (£), cos(t) sin(t)). The velocity vector is then 
r '(t) = (-2 sin(t) cos(t), - sin 2 (£) + cos 2 (£)) = (x(t),y(t)). The line integral is 

/ F{r{t)) -r'{t) dt= I (cos 4 (£) sin(t), 0) • (-2 sin(t) cos(t), - sin 2 (£) + cos 2 (£)) dt 
Jo Jo 

= -2 1 sin 2 (t) cos 4 (t) dt = -2(£/16 + sin(2£)/64 - sin(4t)/64 - sin(6£)/192)|£ = -tt/8 
Jo 

WORK. If F is a force field and r(t) a path of a body, then F(r(t)) is the force acting on the body. The 
component of that force in the velocity direction is G(t) = F(r(t)) ■ r'(t)/\r'(t)\. For some small time dt, the 
body will move a distance \r'(t)\dt. In physics, G(t)ds is the amount of work done when traveling this distance. 
Integrating up gives the total work or energy W = f G(t)\r'(t)\ dt = f = F(r(t)) ■ r'(t) dt. 



W = f F ds is the energy gained by a body traveling along the 
path 7 in a force field F. 



EXAMPLE. In ion rockets (used for example in "deep 
space" space craft), ionized xenon gas is passed by an elec- 
trically charged plate and accelerated to high velocities 
30km/s. This (kinetic) energy is built up as work in a 
force field F which is parallel to an electric field E. Let 
7 : r(t) = (£, 0,0), < t < L be the path a particle trav- 
els between the positively charged plate and the negatively 
charged plate. 

J E dr is the voltage difference between the two plates and 
J 7 F dr is the energy difference of the particle. Because 
F = cE, where e is the charge of the ion and the velocity r' 
of the ions is parallel to the field, we know that ELc is the 
voltage difference and FL is the energy difference which is 
mv 2 /2, where m is the mass and v the velocity of the ion, 
we could get the electric field strength E = mv 2 /(2Lc). 



ADDING AND SUBTRACTING CURVES. 

If 7i, 72 are curves, then 71+72 denotes the curve obtained 
by traveling first along 71, then along 72. One writes —7 
for the curve 7 traveled backwards and 71 —72 = 7i + (— 72)- 

EXAMPLES. If 71 (t) = (t,0) for t 6 [0,1], 72(f) = (1, (t - 
1)) for t e [1,2], 73 (t) = (1 - (t - 2),1) for t 6 [2,3], 
7 4 (t) = (0, 1 — (t— 3)) for t G [3, 4], then 7 = 71+72+73+74 
for t G [0, 4] is the path which goes around a the unit square. 
The path —7 travels around in the clockwise direction. 



CALCULATING WITH LINE-INTEGRALS 

• i i F-dr + i i G-dr = i i {F + G)-dr. 

• S 1 cF ' -dr = cf^F -dr. 



VOLUME-PRESSURE. 
Processes involving gases 
or liquids can be described 
in a Volume-pressure 
diagram. 




A periodic processes like a refrigerator defines a closed cycle 7 : 1 1— ► r(t) = (V(t),p(t)) in the V — p plane. The 
curve is parameterized by the time t. At a given time the gas has a specific volume V(t) and a specific pressure 
p(t). Consider the vector field F(V,p) = (p,0) and a closed curve 7 and the line integral J^F ds. Writing it 

out, we get / 2 >(t),0) • (V'(t),p' (t)) dt = J^p(t)V'(t) dt = J^pdV. 

If the volume of the gas changes under pressure p, then the work on the system is pdV. On the other hand, if 
the volume is kept constant, then for a gas, one does not do work on the system, when changing the pressure. 
Processes described by this approximation are called adiabatic. 




. J F-dr = -f F-dr. 



For example, if the volume is decreased under high pressure and increased under low pressure then we do the 
work J^p dV. Lets compute that if r(t) = (2 + cos(t), 2 + sin(t)) for t G [0, 2tt] and F(V,p) = (p, 0). 
r'(t) = (-sin(£),cos(£)), F(r(t)) = (2 + sin(t), 0). so that F(r(t)) ■ r'(t) = - sin(t)(2 + sin(t)) and 
J 27r F{r{t)) ■ r'{t) dt = - ^ sin 2 (t) dt = tt. 
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SCALAR LINE INTEGRALS. 

If f(r(t)) is a function defined on a curve 7 : t 1— > f(t), then 
integral of / along the curve 7. 



f a f(f(tW(t)\dt 



is called the scalar line 



NOTATION. The short-hand notation J f ds is also used. 

WRITTEN OUT. If f(x,y) is the function and f(t) = (x(t),y(t)), we can write 

la /(*(*), y{t))^W+¥W dt. 



In three dimensions, where r{t) = (x(t), y(t), z(t)), we can write J a f(x(t),y(t),z(t))^x'(t) 2 +y'(t) 2 + z'(t) 2 dt. 



EXAMPLE. Integrate f(x, y, z) = x 2 + y 2 + z 2 over the path r{t) = (cos(t), sin(t), t) from t = to t = tt. The 
answer is /^(l + t 2 )^ = ttv^ + tt 3 \/2/3. 

EXAMPLE: Let r(t) = {cos(t), sin(t), t 2 /2} be the path of a model plane. What is the average height of the 
plane? 



This is not a very clearly formulated question. We want to know 

,2tt ,2tt t 2 

J z(t)\r'(t)\dt = J -VT^dt 

if we want to know the average height of the path and 

f z{t) dt= [ t 2 /2 dt 
Jo Jo 



if we want to know the average height per time. 




EXAMPLE. One of the hits on the web in March 2004 was a photo report of a Russian girl "Elena" who 
rode with her Kawasaki motorcycle and a camera through the deserted Chernobyl area and left an impressive 
document on the web. The URL is http://www.angelfire.com/extreme4/kiddofspeed/ 



Assume Elena picks up radioactive radiation 
proportional to the radioactivity level f{x,y) 
and the amount of path covered with the bike, 
then the total radiation obtained during the 



time [0, T] is 



../() 



f(r(t))\r'(t)\dt. 



This is not realistic. If Elena stops, she would 
get no radiation increase. The correct integral 
would rather be 



f 

Jo 



f(r(t)) dt . 




REMARK. 



I Scalar line integrals should be thought as a generalization of the length integral. j 

Do not mix it up with the line integral defined by a vector field which we cover next and which is infinitely 
more important. 



The examples above show that dealing with scalar line integrals can be confusing. You are measuring quantities 
with it which are given "per distance" and not quantities "per time". Scalar integrals hardly appear in 
applications. (They do for example in tomography where the problem is to reconctruct f(x,y, z) from knowing 
all line integrals along all lines. But also there, it is possible and simpler to avoid them.) 

• the application of computing mass is very artificial because mass is a triple integral. All solid bodies, even 
wires have three dimension. 

• In general, for one dimensional situations, the density is constant so that the scalar line integral is actually 
a usual length integral. Nobody questions the importance of the length integral. 

• For "center of mass" or moment of inertia" computations, one better uses triple integrals. Arcs or wires 
have a nonzero radius and the "simplification" done by computing it with one dimensional integrals 
produces an error. For mass computations, the error is zero by the Pappus Centroid theorem. 
Introducing artificial scalar line integrals is not only confusing, it is also not precise. 

The topic has been introduced into calculus text books as a "bridge" to ease the transition from ID integrals 
to line integrals but experience shows that this " bridge" unnecessarily complicates things because it introduces 
a new concept. 



Just treat the topic as a "footnote" to the length integral. 
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Section 13.2, Problems 20,24,40 



REVIEW LINE INTEGRALS. 



J C F -dr = S h a F{r{t))-r'{t)dt 



is called the line integral of F along the curve C. If F is a force, then the line integral is work. 



A RIDDLE ABOUT WORK. 

1) If you accelerate a car from Om/s to 
10m/ s = 2.24miles/hour the kinetic energy 
needed is mass times 10 2 /2 = 50. 



to 



2) If you accelerate a car from 10m/ s 
20m/ s = 4.48miles/hour the kinetic energy has 
increased by 20 2 /2 - 10 2 /2 = 200 - 50 = 150. 

3) Now watch situation 2) from a moving coordinate 
system which moves with constant 10m/ s. In that 
system, the car accerates from to 10 meter per 
second. 

Because all physical laws are the same in different 
coordinate systems (going from one to the other is 
a Galilei transformation), the energy should be the 
same when accelerating from to 10 or from 10 to 20. 

This is in contradiction to the fact that accelerating 
the car from to 10 needs three times less energy 
than accelerating the car from 10 to 20. Why? 




The picture above shows the " Smart car" , 
an extremly energy friendly car seen often 
in Europe, where gas prizes are much higher 
than here (even now) 



EXAMPLE. Compute the line integral for the vector field F(x, y) = (x 2 , y 2 ) along the boundary of a square i 
the counter clockwise direction. 



We split up the path into four paths: 

71 : r(t) = (t,0) for t G [0,1], 72 : r(t) = (l,t) for t G [0,1], 
73 : r(t) = (1-M) for t G [0,1], 74 : r(t) = (0,1-*) for t G [0,1], 
then 7 = 71 + 72 + 73 + 74- 

The integral is then 

IF ■dr=j F -dr+ I F ■ dr + / F ■ dr + / F ■ dr 

J 7 J~fi ^72 >^73 "^74 



Plugging in F ■ dr = F(r(t)) ■ r'(t) gives 

/ (t 2 ,0)- (1,0) dt + f (l,t 2 )- (0,1) dt + f ((1 -t) 2 ,l)- (-1,0) dt + f (0,(l-£) 2 )-(0,-l)< 
Jo Jo Jo Jo 

and end up with J* 2t 2 - 2(1 - t) 2 dt = 0. 




AMPERES LAW. A wire along the z-axes carries a current 
/. What is the strength \B\ of the magnetic field B(x,y,z) = 
(—By, Bx, 0) in distance r from the z-axes if we know that Amere 
law 



L 



B ■ dr = [iqI 



holds for a closed curve C : f(t) = (r cos(t), r sin(t), 0) winding in 
distance r once around the x axes and where /xo is a constant. 
Remark. The Ampere law actually follows from a basic identity 
which is one of the Maxwell equations and which we will see later 
in this course. 




Solution: compute B(r(t)) = (— B sin(t), B cos(t), 0) and r'(t) = (— r sin(t), r cos(t)) The line integral is 2irrB 
and because we know this is (il, we have B = /j,I/2irr. 



T I T 



We see that the magnetic field decays like 1/r. One can actually 
derive this result also from special relativity: A striking applica- 
tion of the Lorentz transformation is that if you take two wires 
and let an electric current flow in the same direction, then the dis- 
tance between the electrons shrinks: the positively charged ions * 
in the wire see a larger electron density than the ion density. The 
other wires appears negatively "charged" for the ions and attract 
each other. If the currents flow in different directions and we go 
into a coordinate system, where the electrons are at rest in the 

first wire, then the ion density of the ions in the same wire appears - . - . - - - Z*jL "rf Zd, ?jf> w tf w tf *<S *«5 

denser and also the electron density is denser in the other wire. 
The two wires then repell each other. The force is proportional to 
1/r because two charged wires in distance r attract or repel each 
other with the force 1/r. 



1 1 1 



A PERPETUUM MOTION MACHINE. Before entering high school, one of my passions was to construct 
" perpetuum motion machines" . Physics classes in school quickly killed that dream. But still, these machines 
still fascinate me, especially if they work! Will will see more about these machines next week, but here is a 
model which allows us to practice line integrals: 



Problem: by arranging cleverly charged plates (see lecture), we realize a static electric field E = (0,0, x). 
construct a wire along the elliptical path C : r(t) = (cos(t), 0, 3 sin(t)). What is the Voltage 

V= [ E(r(t)) ■ r' (t) dt 



Wc 



we measure at the wire? 
Answer: J Q 27r 3cos 2 (£) dt = 3tt. 

30 second background info in electricity: when an electron is moved around in an electric field along 
a path C, it gains the potential J C E ■ dr. This potential is also called "voltage" and measured in "Volts". 
When moving a charge through a voltage differenence, it gains some energy. This energy is proportional to 
the amount of charge going through the wire as well as the voltage V. If the charge is It which is "current" 
times "time", then the energy is Vlt, which can also be seen as the power VI ("volt times ampere = Watts) 
multiplied with time. For example, through a light bulb of lOOVF, a current of about one ampere flows if the 
voltage is 110 Volts. If we let the lamp burn for 10 hours, we use the energy 10 • lOOWh which is one Watt 
hour. In Massachusetts, a Kilowatt hour costs about 10 — 15 cents. 



Running a 100 Watt light bulb for 24 hours costs a quarter. 
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HOMEWORK: Section 13.3: 10,12,38 



REVIEW. If F is a vector field and C : t f(t) is a curve, then 
of F along the curve C. 



J a F(f(t)) ■ f"(t) dt is called the line integral 



GRADIENT FIELD. A vector field F is called a gradient field if there exists a function / such that F — V/. 
For example, F(x,y) = (xy 2 ,yx 2 ) is a gradient field. 



FUNDAMENTAL THEOREM OF LINE INTEGRALS. If F = V/, then 



f b a F(r(t))-r'(t)dt = f(r(b))-f(r(a)) 



In other words, the line integral integral is the potential difference between the end points r(b) and r(a) if F is 
a gradient field. 

EXAMPLE. Let f{x, y, z) be the temperature distribution in a room and let r{t) the path of a fly in the room, 
then f{r(t)) is the temperature, the fly experiences at the point r(t) at time t. The change of temperature for 
the fly is ^f(r{t)). The line-integral of the temperature gradient V/ along the path of the fly coincides with 
the temperature difference between the end and initial point. 

SPECIAL CASES. 

r{t) parallel to level curve of / means d/dtf{r{i)) = and r'(t) orthogonal to V/(r(t)) r(t) orthogonal to level 
curve means \d/dtf(r(t))\ = |V/||r'(£)| and r'{t) parallel to V/(r(t)). 

PROOF OF THE FUNDAMENTAL THEOREM. Use the chain rule in the second equality and the fundamental 
theorem of calculus in the third equality of the following identities: 

J" F(r(t)) ■ r'{t) dt = J" V/(r(t)) • r'(t) dt = £ dt = f(r(b)) - f(r(a)) . 



CONSERVATIVE. A field F is called conservative if every line integral is independent of paths. 



| F is a gradient field if and only if it is conservative. | 

Proof. If F = V/, the conservative property follows from the fundamental theorem of line integrals. To see the 
other direction, chose a point y and for every x a path C x connecting y with x and define f(x) = J c F ■ dr. 
The conservative property assures that the result is independent of the chosen path. Note that / is not unique: 
changing y will add a constant to /. 

CLOSED LOOP PROPERTY = ENERGY CONSERVATION. It follows that for a gradient field the lineintegral 
along any closed curve is zero. Conversly, if this "energy conservation" holds, one has a conservative field. 



If F is a gradient field if and only if the line integral along a closed 
curve is zero if and only if the field is a gradient field. 



PERPETUUM MOTION MACHINES. A machine 
which implements a force field which is not a gradi- 
ent field is called a perpetuum mobile. Mathemat- 
ically, it realizes a force field for which along some 
closed loops the energy gain is nonnegative. (By pos- 
sibly changing the direction, the energy change can 
be made positive). The first law of thermodynamics 
forbids the existence of such a machine. 
It is informative to stare at some of the ideas people 
have come up with and to see why they don't work. 
The drawings of Escher appear also to produce situ- 
ations, where a force field can be used to gain energy. 
Escher uses genius graphical tricks however. 





THE COMPONENT TEST. 

When is a vector field a gradient field? F(x,y) = Vf(x,y) implies F y (x,y) = F x (x,y). If this does not 
hold at some point, F = (M,N) is no gradient field. This is called the component test. The condition 
curl(F) = N x — My = implies that the field is conservative if the region satisfies a certain property. 



FIND THE POTENTIAL. 

PROBLEM 1. Let F(x, y) = {2xy 2 + 3x 2 , 2yx 2 ). Find a potential / of F. 

SOLUTION. The potential function f(x,y) satisfies f x (x,y) = 2xy 2 + 3x 2 and f y (x,y) = 2yx 2 . Integrating the 
second equation gives f(x, y) = x 2 y 2 -\-h{x). Partial differentiation with respect to x gives f x (x, y) = 2xy 2 -\-h'(x) 
which should be 2xy 2 + 3a; 2 so that we can take h(x) = x 3 . The potential function is f(x, y) = x 2 y 2 + x 3 . 

Find g, h from f(x, y) = J* M(x, y) dx + h(y) and f y (x, y) = g(x, y). 

PROBLEM 2. Find values for the constants a,b which make the vector field F = (M, N) = {ax 3 y + by 2 , x 4 + yx) 
conservative. 

SOLUTION. The curl N x - M y = Ax 3 + y - ax 3 must be zero. This gives a = 4 and 6=1/2. The potential is 
f(x,y) = (x* + y 2 x/2). 



A COUNTER EXAMPLE? Let F(x, y) = (M, N) = 
IF+y*)- lt is a gradient field because f(x, y) = arctan(y/x) 
has the property that f x = (-y/x 2 )/(l + y 2 /x 2 ) = M,f y = 
(1 / x) I (1 + y 2 1 x 2 ) = N. However, the line integral J^F ds, where 
7 is the unit circle is 



Jo 



- sin(t) 



cos(t) 



Jo cos 2 (t) + sin (t) cos 2 (t) + sin 
which is J Q 27r 1 dt = 2tt. What is wrong? 



• (— sin(t), cos(t) dt 



Solution: note that the potential / as well as the vectorfield F are 
not smooth everywhere. 




SIMPLY CONNECTED. A region R is called simply connected, if every curve in R can be contracted to a 
point in a continuous way and every two points can be connected by a path. A disc is an example of a simply 
connected region, an annulus is an example which is not. 



"CONSERVATIVE <^ curl(F) = 0. 



If R is a simply connected region, then F is a gradient field if and 
only if curl(F) = everywhere in R. 



We will prove this on Friday. 





Gradient field 


SUMMARY. 




We have three equivalent prop- 
erties: conservative=gradient 
field= closed loop property. 
They imply the component 




~ Closed loop 
Conservative , r 
property 


test curl(F) = 0. If the region 
is simply connected, all four 
properties are equivalent. 


Passes 
component test 
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MAIN 

MIDTERM 

TOPICS. 



Extrema of functions of two variables. 
Extrema of functions with constraints. 
Error of linear approximation 
Integration in two dimensions 
Integration in three dimensions 
Integration in cylindrical coordinates 



Linear approximation. 
Estimation using linear approximation 
Tangent lines and planes 
Integration in polar coordinates 
Integration in spherical coordinates 
Application of integrals 



LEVELS OF UNDERSTANDING. 



I) KNOW Know what the objects, definitions, theorems, names are. Know jargon and history, 

II) DO Know how to work with the objects. Pursue algorithms. 

III) UNDERSTAND See different aspects, contexts. Why is it done like this? 

IV) APPLY Extend the theory, apply to new situations, invent new objects. Ask: Why not...? 



I) Definitions and objects. 

CONSTRAINED EXTREMUM of / constrained by g = c are obtained where V/ = Wg,g = c of W(g) = 0. 
CRITICAL POINT. Wf(x,y) = (0,0). Is also called stationary point. 
DOUBLE INTEGRAL. £ fjj*^ f(x, y) dydx is an example of a double integral. 
2D POLAR INTEGRAL. / f R f(r,<f>) rdrd<j> in polar coordinates. 
DISCRIMINANT. D = f xx (x,y)f yy (x,y) - fxy 2 (x,y). 

LINEAR APPROXIMATION. L(x, y) = f(x ,y ) + V/(rc , Vo) • (x - x , y - y ). 

ERROR ESTIMATE. \L(x, y) - f(x,y)\ < M/2\(x - x ) 2 + (y - y ) 2 \ where \ f xx \ < M, \f yy \ < M, \f xy \ < M 
in rectangle having (xo,yo) and (x,y) opposite corners. 

LOCAL MAXIMUM. A critical point for which det(H(x,y)) > 0,H xx (x,y) < is a local maximum. 
LOCAL MINIMUM. A critical point for which det(H(x,y)) > : H xx (x,y) > is a local minimum. 
SADDLE POINT. A critical point for which det(H(x,y)) < 0. 

SECOND DERIVATIVE TEST. D < saddle, D > 0, f xx > min, D > 0, f yy < max. 

ON CLOSED DOMAIN. There is a maximum and a minimum. If it is not inside, it is on the boundary. 



II) Algorithms: INTEGRATION OVER A DOMAIN R. 



1) Eventually chop the region into pieces which can be parametrized. 

2) Start with one variable, say x and find the smallest ^-interval [a, b] which contains R. 

3) For fixed x, intersect the line x = const with R to determine the ^/-bounds [f(x),g(x)]. 

4) Evaluate the integral dydx. 

5) Solve the double integral by ID integration starting from inside. 

6) In case of problems with the integral, try to switch the order of integration. (Go to 2) and start with y). 

EXAMPLE. Integrate x 2 y 2 over the triangle x + y/2 < 3,x > 0,y > 1. The triangle is contained in the strip 
< x < 3. The ^-integration ranges over the interval [0, 3]. For fixed x, we have y > 1 and y < 2(3 — x) which 
means that the y bounds are [0, 2(3 - x)}. The double integral is ^ f* x 2 y 2 dydx. 



II) Algorithms: FINDING EXTREMA ON A DOMAIN R with boundary. 

1) First look for all stationary points V f(x, y) in the interior of R. 

2) Eventually classify the points in the interior by looking at D, f xx (x : y) at the critical points. 

3) Locate the critical points at the boundary by solving VF(x, y) = WG(x, y), G(x, y) = c. 

4) List the values of F evaluated at all the points found in 1) and 3) and compare them. 

EXAMPLE. Find the maximum of F(x, y) = x 2 — y 2 — x 4 — y 4 on the domain x 4 + y 4 < 1. 
X7F(x,y) = (2x — 4x 3 , —2y — 4y 3 ). The critical points inside the domain are obtained by solving 2x — 4x 3 = 
0,—2y - Ay 3 = which means x = 0,x = 1/4, y = 0,y = -1/4. We have four points Pi = (0,0), P 2 = 
(1/V2,0),P 3 = (0,-l/v / 2),P 4 = (l/v / 2,-l/v / 2). 

The critical points on the boundary are obtained by solving the Lagrange equations (2x — 4x 3 , —2y — Ay 3 ) = 
\{4x 3 ,4y 3 ),x 4 + y 4 = 1. Solutions (see below) are P 5 = (0,-1), P 6 = (0,1), P 7 = (-1,0), P 8 = (1,0). A list 
of function values F{P X ) = 0,P(P 2 ) = 1/2 - 1/4, P(P 3 ) = -1/2 - 1/4, P(P 4 ) = -2/4, P(P 5 ) = -2,P(P 6 ) = 
—2,F(Pi) = 0,P(Ps) = shows that P2 in the interior is the maximum. Indeed, the Hessian at this point is 
H = diag(— 1, —2) which has positive determinant and negative H\\. 



II) Algorithms: SOLVE LAGRANGE PROBLEMS. 

1) Write down equations neatly. 

2) See whether some variable can be eliminated easily. 

3) If some variable can be eliminated easily, go back to 1) using one variable less and repeat. 

4) Try to combine, rearrange, simplify the equations. The system might not have an algebraic solution. 



2x - Ax 3 


= A\x 3 


2a:- 


(4 


+ 4A)£ 3 


= 


x = or 


x = 1/V2 + 2A 


-2y - Ay 3 


= 4A^ 3 


-2y- 


(4 


+ 4A)^ 3 


= 


y = or 


? / = -l/v / 2 + 2A 


x 4 + y 4 


= 1 






x 4 + y 4 


= 1 


x 4 + y 4 = 


1 



If x=0, then y=l, or y=-l, if y = then x = 1 or x = — 1. There are 4 critical points. 

III) Understanding: Try to answer questions like: 

What can happen at a critical point if the discriminant What does it mean if the Lagrange multiplier is zero? 
is zero at this point? Is there / with two saddles as critical points and no 

What is the geometric meaning of f xx 7 other critical points? 

IV) Apply to new situations. 

Problem solving and creativity skills are acquired best by "doing" it, by pondering over new questions, working 
on specific problems. Nevertheless, there is also theoretical help: for example, Polyas's book "how to solve it" 
gives some general advise on "how to solve problems". Abstract: "How to solve a problem?": 



1) Understand the problem. 

2) Think of a plan by solving subproblems. 
Connect with older problems. 



3) Walk along the plan while controlling each step. 

4) Check the result. Is the result obvious? Is the 
method useful for other problems? 



PRO MEMORIAM. Solve integration problems with the help of good figures! 



polar coordinates 


cylindrical coordinates 


spherical coordinates 


ff R g(r,0)\r)iBdr 

x = rcos(6>) 
y = rsin(0) 


JJJ R g(r,d,z)\T]drd0dz 
x = rcos(6>) 
y = rsin(0) 

z = z 


JJJ R g(p,e,z) p 2 sin(0) dpdOdcj) 


x = psin(0) cos(0) 
y = psin(0) sin(0) 
z = pcos(0) 



One uses these formulas to express the function f(x, y, z) in the new coordinates as a new function g like in 



f(x,y) = {x 2 + y 2 ) 2 + xy 
f(x,y,z) = x 2 + y 2 + z 2 - 



g (r, 6) = r 4 + r 2 cos(#) sin(0) 
g(p, 6, 4>) = P 2 - P 3 cos 3 (fl) sin 3 (0) 



Application 


2D Integral 


Area 

Mass 

Average 

Centroid 

Center of mass 

Moment of inertia 

Radius of gyration 


A = jj R 1 dA 

M = ff R 5(x,y) dA 

ff R f(x,y) dA/A. 

(JJ R xdA/AjJ R ydA/A). 

(JJ R x5{ Xl y)dA/M, JJ R y5{ Xl y)dA/M). 

I = JJ R (x 2 + y 2 ) dA (with respect to origin) 

VTJm 



Application 


3D Integral 


Volume 

Mass 

Average 

Centroid 

Center of mass 

Moment of inertia 

Radius of gyration 


v = SS R ldV 

M = JJJ R 6(x,y,z) dV 

JJJ R f(x,y,z) dV/V. 

(JJJ R x dV/V, JJJ R y dV/V, JJJ R z dV/V). 

(JJJ R x5 dV/M, JJJ R x5 dV/M, JJJ R y5 dV/M). 

I = JJJ R (x 2 + y 2 ) dV (with respect to z-axes.) 

VTJm 
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HOMEWORK: section 13.4: 4,22,24,26 



LINE INTEGRALS (RECALL). If F(x, y) = (M(x,y), N(x,y)) is a vector field and C : r(t) = (x(t),y(t)),t G 
[a, 6] is a curve, then 



J^F ds = f a F(x(t),y(t)) ■ (x>(t),y>(t)) dt 



is called the line integral of F along 7. Its helpful to think of the integral as the work of a force field F along 
C. It is positive if "the force is with you", negative, if you have to "fight against the force", while going along 
the path C. 



THE CURL OF A 2D VECTOR FIELD. The curl of a 2D vector field F(x,y) = {M{x,y),N{x,y)) is defined 
as the scalar field 

cw\(F)(x,y) = N x (x,y) - M y (x,y) . 

INTERPRETATION. curl(F) measures the vorticity of the vector field. One can write V x F = curl(F) for 
the curl of F because the 2D cross product of (d x ,d y ) with F = (M, N) is N x — M y . 

EXAMPLES. 

1) F{x,y) = (-y,x). cuil(F)(x,y) = 2. 

2) F(x, y) = V/, (conservative field = gradient field = potential) Because M(x, y) = f x (x, y),N(x, y) = f y (x, y), 
we have curl(F) = N x - M y = f yx - f xy = 0. 



GREEN'S THEOREM. (1827) If F(x,y) = [M{x,y) 1 N{x 1 y)) is a vector field and R is a region which has as 
a boundary a piecewise smooth closed curve C traversed in the direction so that the region R is "to the left". 
Then 



J c F ■ ds = J j R curl(F) dxdy 




Note: for a region with holes, the boundary consists of many curves. They are always oriented so that R is to 
the left. 



GEORGE GREEN (1793-1841) was one of the most remarkable physi- 
cists of the nineteenth century. He was a self-taught mathematician and 
miller, whose work has contributed greatly to modern physics. Unfortu- 
nately, we don't have a picture of George Green. 



♦ 



SPECIAL CASE. If F is a gradient field, then both sides of Green's theorem are zero: 



J c F ■ ds is zero by the fundamental theorem for 
line integrals. 



/ J R curl(F) • dA is zero because 
curl(F) = curl(grad(/)) = 0. 



The fact that curl(grad(/)) = can be checked directly but it can also be seen from V x V/ and the fact that 
the cross product of two identical vectors is 0. One just has to treat V as a vector. 

APPLICATION: CALCULATING LINE INTEGRALS. Sometimes, the calculation of line integrals is harder 
than calculating a double integral. Example: calculate the line integral of F(x,y) = (x 2 — y 2 1 2xy) = (M,N) 
along the boundary of the rectangle [0,2] x [0,1]. Solution: curl(F) = N x — M y = 2y — 2y = —4y so that 
J c F dr = Si So Ay dydx = 2y 2 \lx\ 2 = 4. 



Remark. One can easily find examples, where the calculation of the line integral is not possible in closed form 
directly, but where Green allows to do it nevertheless. 



WHERE IS THE PROOF? (Quote: General Hein in "Final Fantasy"). 

To prove Green's theorem, look first at a small square R= [x,x + e]x[y,y + e]. The line integral of F = (M, N) 
along the boundary is / Q e M(x + t, y)dt + Sq N(x + e,y + t) dt- So M(x + t,y + e) dt- Sq N(x, y + t) dt. (Note 
also that this line integral measures the "circulation" at the place (x,y).) 

Because N(x + e, y) — N(x, y) ~ N x (x, y)e and M(x, y + e) — M(x, y) ~ 
M y (x,y)e, the line integral is (M x — N x )e 2 is about the same as 
S Q e Sq curl(F) dxdy. All identities hold in the limit e — > 0. 
To prove the statement for a general region R, we chop it into small 
squares of size e. Summing up all the line integrals around the bound- 
aries gives the line integral around the boundary because in the interior, 
the line integrals cancel. Summing up the vorticities on the rectangles 
is a Riemann sum approximation of the double integral. 



APPLICATION: CALCULATING DOUBLE INTEGRALS. 

Sometimes, the reverse is true and it is harder to calculate the double integral. An example is to determine the 
area of a polygon with sides (xi,yi), (x n ,y n ) (see problem 21, section 13.4 in the book). In that case, there 
is a closed formula for the area: A=\ Y^i=i( x iVi+i ~ x i+iVi)i something a computer can do evaluate very fast 
and which does not involve any integration. This formula is used for example in computer graphics. 



APPLICATION: FINDING THE CENTROID OF A REGION. 
See homework. Green's theorem allows to express the coordinates of the centroid 

(J J R XdA/A,J J R ydA/A) 
as line integrals. One just has to find the right vector fields for each coordinate. For example, to verify 

[ [ xdA = [ Fdr 

J JR JC 

take the vector field F dr = x 2 dy. 



APPLICATION: AREA FORMULAS. 




The vector fields F(x,y) = (M,N) = (-y,0) or F(x,y) = (0,x) have vorticity cvnl(F(x,y)) = I. The right 
hand side in Green's theorem is the area of R: 

Area(R) = f c —y dx = S c x dy 



EXAMPLE. Let R be the region under the graph of a function f(x) on [a, 6]. The lineintegral around the 
boundary of R is from (a, 0) to (6, 0) because F(x, y) = there. The lineintegral is also zero from (6, 0) to 
(6,/(6)) and {aj{a)) to (a,0) because = 0. The line integral on (t,f(t)) is - /^((-^(t), 0) • (l,f'(t))dt = 
J b fit) dt. Green's theorem assures that this is the area of the region below the graph. 

APPLICATION. THE PLANIMETER. 

The planimeter is a mechanical device for measuring areas: in medicine jfcv '^^^^^^BMHP^^H 

to measure the size of the cross-sections of tumors, in biology to measure i KpH^^^^^B^'f * s iHr] 

the area of leaves or wing sizes of insects, in agriculture to measure the HilH ^KKlfiiS^I 

area of forests, in engeneering to measure the size of profiles. There is HEUl 

a vector field F associated to a planimeter (put a vector of length 1 HQI 

orthogonally to the arm). One can prove that F has vorticity 1. The w^^Hft^9^?!II^^^^H^ 

planimeter calculates the line integral of F along a given curve. Green's 'i^'I^^H ^Bfe!^^^^Bj 

theorem assures it is the area. rVl^^HH^^^H^^^^Kl 

The picture to the right shows a Java applet which allows to explore the ^■^^^^^^^^^^^■^H^B 

planimeter (from a CCP module by O. Knill and D. Winter, 2001). 

To explore the planimeter, visit the URL http://www.math.duke.edu/education/ccp/materials/mvcalc/green/ 
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GREEN'S THEOREM is extremely useful in physics. 

It belongs to the most advanced topics in calculus. If you master it, you have so to speek the black belt in 
calculus. Of course, it needs some time to learn. 



THERMODYNAMICS. When studing gases or liquids, one draws often a P — V diagram (evenso one draws 
V usually as the x-axis). The volume in the x-axis and the pressure in the y axis. 

For periodic (running of a car, the pump in a refridgeratore), one gets a closed curve 7 : t ^ r(t) = (V(t),p(t)) 
in the V — p plane. The curve is parameterized by the time t. At a given time, the gas has a specific volume 
V(t) and a specific pressure p(t). 

Consider the vector field F(V,p) = (p, 0) and a closed curve 7. What is J F dsl Writing it out, we get 
/ T O(£),0) • (V'(t),p'(t) dt = J T p{t)V'(t)dt = JoPdV. The curl of F(V,p) is -1. You see that the integral 
— Jq pdV is the area of the region enclosed by the curve. 

Where is the physics? If the volume of a gas changes by constant pressure, then the work on the system is 
pdV. On the other hand, if the volume is held constant, then for a gas, one does no work on the system, when 
changing the pressure. 

EXAMPLE: ELECTRO ENGINE. Let us look at a cyclic process, where the volume is decreased under low 
pressure and increased under high pressure. It is clear that the gas does some work in this case. How much is 
it? Well, it is Jo pdV for a curve which goes clockwise around a closed region R. Green's theorem tells us that 
the work done is the area of the region R. 



ELECTROMAGNETISM. In the plane (flatland), the electric field is a vector field E = {E ll E 2 ) 1 while the 
magnetic field is a scalar field. One of the 2D Maxwell equation is curl(i?) = —^-^B, where c is the speed of light. 

Consider a region R bounded by a wire 7. Greens theorem tells us that d/dt J J R B(t)dxdy is the lineintegral 
of E around the boundary. But f Eds is a voltage. A change of the magnetic field produces a voltage. This is 
the dynamo. 

This statement will become much more clear and useful when we lift Green to three dimensions (which will be 
called the theorem of Stokes). 



FLUID DYNAMICS. If v is the velocity distribution of a fluid in the plane, then u{x,y) = c\xi\{v){x 1 y) is 
called the vorticity of the fluid. 

If v = VU (potential flows), then the fluid is called irrotational. As you can check, then oj = curl(v) = by 
Green's theorem. 



The integral j j R codxdy is called the vortex flux through R. Green's theorem tells that this flux is related to 
the circulation on the boundary. 

In water, you could ideally measure the amount of vorticity inside a region R by measureing the work, you have 
to do by swimming around the boundary of R. 



A RELATED THEOREM. If we rotate the vector field F = (P, Q) by tt/2 we get a new vector field G = (-Q, P) 
and the integral J^F • ds becomes a flux of G through the boundary of R. Introducing div(F) = (P x + Q y ) 
we see that curl(F) = div(G). Greem's theorem now becomes 

J J div(G) dxdy = J G ■ dn , 

where dn(x,y) is a normal vector at (x,y) orthogonal to the velocity vecgtor r'(x,y) at a point (x,y). 

This new theorem has a generalisation to three dimensions too. It is called Gauss theorem or divergence 

theorem. 
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HOMEWORK. Section 13.5: 2,4,10, Section 13.6: 20 



SURFACE AREA 



is the area of the surface. 




INTEGRAL OF A SCALAR FUNCTION ON A SURFACE. If 
S is a surface, then j j s f(x,y) dS/Area should be an average 
of / on the surface. If f(x,y) = 1, then A = J J s dS should 
be the area of the surface. If S is the image of r under the map 

(u,v) i ► f(u,v), then dS = \r u x r v \ dudv. 




Given a surface S = f(R), where R is a domain in the plane and where f(u, v) = (x(u, v), y(u, v), z(u, v)). The 
surface integral of f(u, v) on S is defined as J J s f dS = J J R f(u, v)\r u x r v \ dudv. 

INTERPRETATION. If f(x, y) measures a quantity then J J s f dS/ J f s ldS is the average of the function / 
on S. Analoguously to the scalar line ingegral f(r(t)\r' (t)\ dt, the scalar surface integral should be considered 
a footnote to the surface area integral only. 

EXPLANATION OF \ f u x f v \. The vector r u is a tangent vector to the curve u i— > f(u,v), when v is fixed and 
the vector f v is a tangent vector to the curve v i— > r(u : v) : when u is fixed. The two vectors span a parallelogram 
with area \ f u x f v \. A little rectangle spanned by [u, u + du] and [v, v + dv] is mapped by f to a parallelogram 
spanned by [r, r + r u ] and [r, r + r v ]. 

A simple case: consider f(u,v) = (2u,3v,0). This surface is part of the xy-plane. The parameter region R just 
gets stretched by a factor 2 in the x coordinate and by a factor 3 in the y coordinate, r. u x f v = (0, 0, 6) and we 
see for example that the area of f(R) is 6 times the area of R. 



POLAR COORDINATES. If we take r(u,v) = (ucos(v),wsin(v),0), then the rectangle [0,R] x [0,2tt] is 
mapped into a flat surface which is a disc in the xy-p\&ne. In this case r u xr v = (cos(i>), sin(v), 0) x 
(— u sin(i>), u cos(u), 0) = (0,0, u) and \r u x r v \ = u = r. We can explain the integration factor r in polar 
coordinates as a special case of a surface integral. 



THE AREA OF THE SPHERE. 



The map r : (u,v) ^ (L cos(w) sin(v), L sin(u) sin(i>), L cos(v)) maps the rectangle R : [0,2ir] x [0,7r] onto 
the sphere of radius L. We compute f u xf v = L sm(v)f(u, v). So, \f u x f v \ = L 2 \sin(v)\ and J J R 1 dS ■ 
Jq 7 " Jq L 2 sin(v) dvdu = AttL 2 . 



SURFACE AREA OF GRAPHS. For surfaces (u,v) ^ 
(u, v, f(u, v)), we have f u = (1, 0, f u (u, v)) and f v = 
(0, 1, f v (u, v)). The cross product r u x r v = (—f u ,—f v ,l) 
has the length y^TTf 2 ! 7 ! 2 . The area of the surface above a 

region R is 



EXAMPLE. The surface area of the paraboloid z = f(x, y) = x 2 + 
y 2 is (use polar coordinates) \/T + 4r 2 r drdO = 27r(2/3)(l + 
4r 2 ) 3 / 2 /8|J = tt(5 3 / 2 - l)/6. 




AREA OF SURFACES OF REVOLUTION. If we rotate 
the graph of a function f(x) on an interval [a, 6] around 
the x-axis, we get a surface parameterized by (u, v) ^ 
f(u,v) = (v, f(v) cos(w), f(v) sin(w)) on R = [0,2tt] x [a,b] 
and is called a surface of revolution. We have f u = 
(0, — f(v) sin(w), f(v) cos(w)),r\, = (1, f'(v) cos(w), f'(v) sin(w)) 
and r u x r v = (— f(v)f'(v), f(v) cos(w), f(v) sin(ii)) 
= f(y)(— f'(v), cos(w), sin(ii)). The surface area is 

J J \ f u x f v \ dudv = 2ir \f(v)\y/l + f'{v) 2 dv. 

EXAMPLE. If f(x) = x on [0, 1], we get the surface area of a cone: / Jo xVTTT dvdu = 2^x^/2 = irV2. 

P.S. In computer graphics, surfaces of revolutions are constructed from a few prescribed points (xi, f{xi)). The 
machine constructs a function (spline) and rotates 



GABRIEL'S TRUMPET. Take f(x) = l/xon the interval [1, oo). 
Volume: The volume is (use cylindrical coordinates in the x di- 
rection): 7rf(x) 2 dx = 7r Jj 00 l/x 2 dx = IT. 
Area: The area is Jo JT 1/xy/l + l/x 4 dx > 2ttJ™ l/x dx = 
27rlog(x)| x> = oo. 



The Gabriel trumpet is a surface of finite volume but with infinite surface area! You can fill the trumpet with 
a finite amount of paint, but this paint does not suffice to cover the surface of the trumpet! 

Question. How long does a Gabriel trumpet have to be so that its surface is 500cm 2 (area of sheet of paper)? 
Because 1 < y/l + l/x 4 < \/2, the area for a trumpet of length L is between 2tt l/x dx = 27rlog(L) and 
V / 227rlog(L). In our case, L is between e 500/(V227r) _ 2 + 1Q 24 cm and e 500/(27r) _ 4 + 10 34 cm _ Note that the 
universe is about 10 26 cm long (assuming that the universe expanded with speed of light since 15 Billion year). 
It could not accommodate a Gabriel trumpet with the surface area of a sheet of paper. 



MOBIUS STRIP. The surface f(u, v) = (2 + v cos(w/2) cos(w), (2 + 
v cos(u/2)) sin(w), v sin(w/2)) parametrized by R = [0, 2ir] x [—1, 1] 
is called a Mobius strip. 

The calculation of | f u xf v | = 4+3v 2 /4+4vcos(w/2)+v 2 cos(w)/2 is 
straightforward but a bit tedious. The integral over [0, 2ir] x [— 1, 1] 
is 17 it. 



QUESTION. If we build the Moebius strip from paper. What is 
the relation between the area of the surface and the weight of the 
surface? 

REMARKS. 

1) An OpenGL implementation of an Escher theme can be ad- 
mired with "xlock -inwindow -mode moebius" in XI 1. 

2) A patent was once assigned to the idea to use a Moebius strip 
as a conveyor belt. It would last twice as long as an ordinary 
one. 



SURFACE AREA OF IMPLICIT SURFACES. If f(x,y,z) = c is an implicit surface which is a graph over 
a region R in the xy-plane (add the graph assumption to the box on page 1095 in the Thomas), then the 
surface area is J J R |V/|/|/ Z | dA if f z ^ on R. This follows from the formula in the case of the graph and the 

fact that v ffi |^J — - = \f{f x /fz) 2 + {fy/ fz) 2 + 1 = Jg 2 + g 2 + 1, if z = g(x, y) by implicit differentation. 
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FLUX INTEGRAL OF A VECTOR FIELD THROUGH SUR- 
FACE. If S is a surface, and F is a vector field in space we define 
an integral J J s F ■ dS which is called the flux of F through the 
surface S. 




DEFINITION. Given a surface S = r(R), where R is a domain 
in the plane and where r(u,v) = (x(u, v), y(u, v), z(u, v)) is the 
parameterization of the surface. The flux integral of F through 
S is defined as the double integral 



JJ s F-dS = JJ R F(r(u, v)) ■ (r u x r v ) dudv 



The notation dS = (r u x r v ) dudv means an infinitesimal vector 
in the normal direction at the surface. 




INTERPRETATION. If F = fluid velocity field, then / J s F ■ dS is the flux of fluid passing through S. 



EXAMPLE. Let F(x, y, z) = (0, 1, z 2 ) and let S be the sphere with 

r(u,v) = (cos(w) sin(i>), sin(w) sin(v), cos(v)) 
, r u x r v = sin(v)r so that F(r(u, v)) = (0, 1, cos 2 (v)) and 

/ / sin(v)(0, 1, cos 2 (v)) ■ (cos(w) sin(v), sin(w) sin(v), cos(v)) dudv . 
Jo Jo 

The integral is J 27T s'm 2 (v) sin(w) + cos 3 (v) s'm(v) dudv = 0. 

Look at the vector field. Most flux passes in at the south pole, most flux passes out at the north pole. However, 
there is some symmetry and what enters in the south leaves in the north. This explains why the total flux is 
zero. 

WHAT IS THE FLUX INTEGRAL? Because n = r u x r v /\r u x r v \ is a unit vector normal to the surface and 
on the surface, F ■ n is the normal component of the vector field with respect to the surface, we can write 
/ J s F ■ dS = J J F ■ n dS. The function F ■ n is the scalar projection of of F in the normal direction. Whereas 
the formula J J 1 dS gave the area of the surface with dS = \r u x r v \dudv, the flux integral weights each area 
element dS with the normal component of the vector field f(u,v) = (F(r(u,v)) ■ n(r(u,v)). 



BIKING IN THE RAIN. 

I experience (like last Monday) that during rain one gets soaked 
more by the rain when biking fast then by biking slow or walking 
foot. Why? For a biker who drives with speed V along the x axis, 
the rain is a fluid with velocity F = (—V, 0, W), where W is the 
speed of the rain drops falling along the z-axis. If the biker is a 
rectangular box [0, 1/2] x [0, 1] x [1/4, 1/3] we can calculate the flux 
through that surface. Parameterize the front by r(u, v) = (0, u, v) 
in the yz-p\ane. The water soaked up in unit time by the front 
clothes is the flux of the rain through that surface. We know r u x 
r v = (1, 0, 0) and / J s F ■ dS = f^ 2 ^ {-V, 0, W) -(1,0, 0) dudv = 
—V/2. The flux through the side surface is zero (if the rain falls 
vertically) and the flux through the top is constant, namely W/12. 
The total flux is W/12 — V/2. You catch more rain if you drive 
faster. 




PROPERTY ON THE MOON. More than two years ago, in lack of a better present 
idea, I bought property on the moon for family (For 30 Us-Dollars one could get 1 '777.58 
Acres)) at the Lunar Embassy (www.moonshop.com). An acre is now sold for 19.99 Dol- 
lars plus 1.51 Dollars lunar tax. You can also buy an entire "city" for 4500 dollars now. 



The coordinates of the parcel are 
34E,26N. The moon has a ra- 
dius of r=1737 km. The surface 
is 3.810 6 /cra 2 . Multiply this by 
247 gives about 10 billion (10 10 ) 
acres (more than an acre for each 
person on the world). The moon 
surface is parameterized by r(u, v) = 
(r cos(w) sin(v), r sin(w) sin(i>), r cos(v) 
where u is the latitude and tt/2 — v 
is the latitude. If we look at the 
moon from the direction (1,0,0), 
how big does the area of a real estate 
S = r([a,b] x [c,d]) appear? 




The flux of the vector field (1, 0, 0) through the surface is J J s F ■ dS = J a J c r 2 cos(w) sin 2 (v)dvdu. 

This is also the area we see. You notice that the visible area is small for cos(u) small or sin(i>) small, which is 

near 90E 1 , 90W or near the poles. 



EXAMPLE. Calculate the flux of the vector field F(x,y,z) = (l,2,3z) through the paraboloid z = x 2 + y 2 
lying above the region x 2 + y 2 < 1. 

Parameterize it using polar coordinates r(u,v) = (ucos(v),us'm(v),u 2 ) where r u x r v = 

(-2w 2 cos(v),-2w 2 sin(v),w) and F(r{u,v)) = (l,2,3w 2 ). We get J s F ■ dS = ^ ^ {-2u 2 cos{v) - 
4K 2 sin(t>) + 3u 3 )dudv = 1. 



WHERE CAN FLUX INTEGRALS APPEAR? 



• Fluid dynamics The flux of the velocity vector field of a membrane through a surface is the volume of 
the fluid which passes through that surface in unit time. Often one looks at incompressible fluids which 
means div(i>) = 0. In this case, the amount of fluid which passes through a closed surface like a sphere is 
zero because what enters also has toe leave the surface. 

• Electromagnetism The change of the flux of the magnetic field through a surface is related to the voltage 
at the boundary of the surface. We will see why one of the Maxwell equations div(B) = means that 
there are no magnetic monopoles and see how divergence is related to flux. 

• Magnetohydrodynamics Magnetohydrodynamics deals with electrically charged "fluids" like plasmas 
in the sun or solar winds etc. The velocity field of the wind induces a current and the flux of the field 
trough a surface can be used to calculate the magnetic field. 

• Gravity We will see that the flux of a field through a closed surface like the sphere is related to the 
amount of source inside the the surface. In gravity, where F is the gravitational field, the flux of the 
gravitational field through a closed surface is proportional to the mass inside the surface. Mass is a 
" source for gravitational field" . 

• Thermodynamics. The heat flux through a surface like the walls of a house are related to the amount 
of heat which is produced inside the house. A badly isolated house needs has a large heat flux and needs 
a large source of heating. 

• Particle physics. When studying the collision of particles, one is interested in the relation between 
the incoming flux and out-coming fluxes depending on the direction. The key word is: "differential 
cross-section" . 

• Optics. Measurements of "light fluxes" are important for lasers, astronomy, photography. 
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REMINDERS. The curl of a vector field F is 

curl(M, N,P) = V xF=(P y -N Z: M z -P X: N x - M y ) . 

The flux integral of a vector field F through a surface S = r(R) 
is defined as 




F(r(u,v)) ■ (r u x r v ) dudv 

The picture shows a tornado near Cordell, Ok- 
The line integral of a vector field F along a curve C = r([a,6]) is lahoma. Date: May 22, 1981. Photo Credit: 
given as NOAA Photo Library, NOAA Central Library. 

The tornado points into the direction of the 
field curl(F), where F is the velocity of the 
air. Twisters cause annually about 80 deaths 
in the US. The 17 Illinois twisters last Tuesday 
killed 8 people 



J F-dr = J F(r(t)) -r'(t) dt . 



STOKES THEOREM. Let S be a surface with boundary curve C and let F be a vector field. Then 



curl(F) -dS= / F-dr 



L 



Note: the orientation of C is such that if you walk along the surface (head into the direction of the normal 
r u x r v ), then the surface to your left. 

EXAMPLE. Let F(x, y, z) = (—y, x, 0) and let S be the upper semi hemisphere, then curl(F)(x, y, z) = (0, 0, 2). 
The surface is parameterized by r(u,v) = (cos(w) sin(i>), sin(w) sin(t>), cos(v )) on R = [0,27r] x [0,7r/2] and 
r u xr„ = s'm(v)r(u, v) so that cuil(F)(x, y, z)-r u xr v = cos(v) sin(v)2. The integral J^ 2 sin(2v) dvdu = 2ir. 

The boundary C of S is parameterized by r(t) = (cos(t), sin(t), 0) so that dr = r'(t)dt = (— sin(t), cos(t), 0)dt 
and F(r(t)) r'(t)dt = sin(t) 2 + cos 2 (£) = 1. The line integral J c F ■ dr along the boundary is 2tt. 



SPECIAL CASE: GREEN'S THEOREM. If S is a surface in 
the x — y plane and F = (M, N, 0) has zero z component, then 
curl(F) = (0, 0, N x - M y ) and curl(F) • dS = (N x - M y ) dxdy. 




PROOF OF STOKES THEOREM. 

For a surface which is flat, Stokes theorem can be seen with Green's theorem. If we put the coordinate 
axis so that the surface is in the #?/-plane, then the vector field F induces a vector field on the surface such 
that its 2D curl is the normal component of curl(F). The reason is that the third component N x — M y of 
curl(F) = (P y - N Z ,M Z - P x: N x - M y ) is the two dimensional curl: F(r(u, v)) ■ (0, 0, 1) = N x - M y . If C is 
the boundary of the surface, then J J s F(r(u,v)) ■ (0,0, 1) dudv = J c F(r(t)) r'(t)dt. 

For a general surface, we approximate the surface by a mesh of small parallelepipeds. When summing up line 
integrals along all these parallelepipeds, the lineintegrals inside the surface cancel and only the integral along 
the boundary remain. On the other hand, the sum of the fluxes of the curl through boundary adds up to the 
flux through the surface. 




George Gabriel Stokes 



Andre Marie Ampere 



EXAMPLE. Calculate the flux of the curl of F{x,y,z) = 
(—y,x,0) through the surface parameterized by r(u,v) = 
(cos(w) cos(v), sin(u) cos(v), cos 2 (v) + cos(v) sin 2 (w + 7r/2)). 
Because the surface has the same boundary as the upper half 
sphere, the integral is again 2tt as in the above example. 



For every surface bounded by C the flux 
of curl(F) through through the surface 
is the same. The flux of the curl of a 
vector field through a surface S depends 
only on the boundary of S. 




Compare this with the earlier statement: 



For every curve between two points 
A, B the line integral of grad(/) along 
C is the same. The line integral of the 
gradient of a function of a curve C de- 
pends only on the end points of C. 



BIOT-SAVARD LAW. A magnetic field B in absence of an electric field satisfies a Maxwell equation cm\{B) = 
(47r/c)j, where j is the current and c is the speed of light. How do we get the magnetic field B, when the 
current is known? Stokes theorem can give the answer: take a closed path C which bounds a surface S. 
The line integral of B along C is the flux of curl(£>) through the surface. By the Maxwell equation, this is 
proportional to the flux of j through that surface. Simple situation. Assume j is contained in a wire of thickness 
r which we align on the z-axis. To measure the magnetic field at distance R> r from the wire, we take a curve 
C : r(t) = (.Rcos(£), i?sin(£), 0) which bounds a disc S and measure 2ttRB = J c B ■ ds = J J s cur\(B) dS = 
J J s 47r/cj dS = 4ttJ/c, where J is the total current passing through the wire. The magnetic field satisfies 
B = 2J/(cR). This is called the Biot-Savard law. 



THE DYNAMO, FARADEY'S LAW. The electric field E and 
the magnetic field B are linked by a Maxwell equation cwc\(E) = 
— \B. Take a closed wire C which bounds a surface S and consider 
J j s B-dS, the flux of the magnetic field through S. Its change can 
be related with a voltage using Stokes theorem: d/dt J f s B-dS — 
J J S B ■ dS = J J s -c curl(E) • dS = -c J c E ds = U, where 
U is the voltage measured at the cut-up wire. It means that if 
we change the flux of the magnetic field through the wire, then 
this induces a voltage. The flux can be changed by changing the 
amount of the magnetic field but also by changing the direction. 
If we turn around a magnet around the wire, we get an electric 
voltage. That is what happens in a power- generator like an alter- 
nator in a car. In Dynamo implementations, the wire is turned 
inside a fixed magnet. 
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curl: curl(M, N, P) = (P y - N Zl M z - P x , N x - M y ). 
gradient: grad(/) = (f x J y J z ). 
Fundamental theorem of line integrals 

curve r(t) : [a,b] — >• C with boundary {r(a),r(b)}. 



J grad(/)(r(t)) ■ r'{t) dt = f(r(b)) - f(r(a)) 
Stokes theorem: 

surface r(u, v) : R — > S with boundary r(t) : [a, b] — > C 



I cw\(F)(r(u,v)) ■ {r u x r v )dudv = J F(r(t)) ■ r'(t) dt 



CONSERVATIVE FIELDS. F is conservative if is a gradient field F = Vf. The fundamental theorem of line 
integral implies that the line integral along closed curves is zero and that line integrals are path independent. 
We have also seen that F = grad(/) implies that F has zero curl. If we know that F is conservative, how do we 
compute /? If F — (M, N, P) = (f x , f y , f z ), we got / by integration. There is an other method which you do 
in the homework in two dimensions: to get the potential value /, find a path Cp from the origin to the point 
P = (x, y, z) and compute J c F ■ dr. Because line integrals are path independent, the fundamental theorem of 
line integrals gives J c F ■ dr = f(P). 



CONNECTED. A region is called connected if one can connect any two points in the region with a path. 

SIMPLY CONNECTED. A region is called simply connected if it is connected and every path in the region 
can be deformed to a point within the region. 

EXAMPLES 2D: 

>V O C 3» 



Simply connected. Not simply connected. Simply connected. Not simply connected. 



REMINDERS: 



CURL: curl(F) = VxF 



GRAD:|grad(/) = Vf 
FTL: 



/ c grad(/) -dr = f(B)-f(A) 



STOKES: 



J7 s curl(F) .dS = J c F-dr 



CURL(GRAD)=0: 



V x V/ = 



EXAMPLES 3D: 




9 o 



A solid ball is simply con- A solid torus is not simply The complement of a solid ^^q™ ^ & ^ 

nected. connected. ball is simply connected. oms / s no simp y con " 

nected. 



THEOREM. 



In a simply connected region D, a vector field F is conservative if 
and only if curl(F) = everywhere inside D. 



Proof. We already know that F = V/ implies curl(F) = 0. To 
show the converse, we verify that the line integral along any closed 
curve C in D is zero. This is equivalent to the path independence 
and allows the construction of the potential / with F = V f . 
By assumption, we can deform the curve to a point: if ro(t) is 
the original curve and r\ (t) is the curve n (t) = P which stays at 
one point, define a parametrized surface S by r(t, s) = r s (t). By 
assumption, curl(F) = and therefore the flux of curl(F) through 
S is zero. By Stokes theorem, the line integral along the boundary 
C of the surface S is zero too. 



THE NASH PROBLEM. Nash challenged his multivariable class in the movie "A beautiful mind" with a 
problem, where the region is not simply connected. 



Find a region X of R 3 with the property that if V is the set of 
vector fields F on R 3 \ X which satisfy curl(.F) = and W is the 
set of vector fields F which are conservative: F = V/. Then, the 
space V/W should be 8 dimensional. 



You solve this problem as an inclass exercise (ICE). The problem 
is to find a region D in space, in which one can find 8 different 
closed paths Ci so that for every choice of constants (ci,...,cs), 
one can find a vector field F which has zero curl in D and for 
which one has J c ^ F dr = c\ , . . . , J Cs F dr = c%. 

One of the many solutions is cut out 8 tori from space. For each 
torus, there is a vector field Fi (a vortex ring), which has its 
vorticity located inside the ring and such that the line integral of 
a path which winds once around the ring is 1. The vector field 
F = c\Fi + ... + cgF 8 has the required properties. 



CLOSED SURFACES. Surfaces with no boundaries are called closed surfaces. For example, the surface of a 
donought, or the surface of a sphere are closed surfaces. A half sphere is not closed, its boundary is a circle. 
Half a doughnut is not closed. Its boundary consists of two circles. 



THE ONE MILLION DOLLAR QUESTION. One of the Millenium problems is to determine whether any three 
dimensional space which is simply connected is deformable to a sphere. This is called the Poincare conjecture. 



LINEINTEGRAL IN HIGHER DIMENSIONS. Line integrals are defined in the same way in higher dimensions. 
j c F ■ dr, where • is the dot product in d dimensions and dr = r'(t)dt. 

CURL IN HIGHER DIMENSIONS. In d-dimensions, the curl is the field curl(F)^- = d X3 F i -d Xz F j with 

components. In 4 dimensions, it has 6 components. In 2 dimensions it has 1 component, in 3 dimensions, it 
has 3 components. 

SURFACE INTEGRAL IN HIGHER DIMENSIONS. In d dimensions, a surface element in the ij-plane is 
written as dSij. The flux integral of the curl of F through S is defined as j J curl(F) • dS, where the dot 
product is Yli<j cur\(F)ijdSij. If S is given by a map X from a domain R in the 2-plane to R d , U = d u X and 
V = d v X are tangent vectors to that plane and dSij(u,v) = (UiVj — UjVi) dudv. 

STOKES THEOREM IN HIGHER DIMENSIONS. If S is a two dimensional surface in d-dimensional space 
and C is its boundary, then J J s curl(F) ■ dS = J c F ■ ds. 





ICE: NASH'S PROBLEM 



Math 21a 



In this ICE, you solve Nash's problem, he gave to a multivariate calculus class. Remember 
Nash saying in the movie " A beautiful mind" : 



" This problem here will take some of you many months 
to solve, for others amoung you it might take a life time" . 



You will solve it here in 10 minutes ... 




NASH's PROBLEM. Find a subset X of R 3 with the property that if V is the set of vector 
fields F on R 3 \ X which satisfy curl(F) = and W is the set of vector fields F which are 
conservative: F = V/. Then, the space V/W should be 8 dimensional. 



Remark. The meaning of the last sentence means that there should be 8 vectorfields Fi which 
are not gradient fields and which have vanishing curl outside X. Furthermore, you should not 
be able to write any of the 8 vectorfields as a sum of multiples of the other 7 vector fields. 

Here is a two dimensional version of the problem: 
2D VERSION OF NASH's PROBLEM. 

If X = {0} and V is the set of vector fields F on R 2 \ X which satisfy curl(F) = and W is 
the set of vector fields F which are gradient fields, then dim(V/W) = 1. 

The vector field F is F(x,y) = (-y/(x 2 + y 2 ),x/(x 2 + y 2 )) has vanishing curl in R 2 \ X but 
the line integral around the origin is 2-7T. It is not a gradient field. 
Now: What set X would you have to take to get dim(V/ W) = 8? 



The SOLUTION OF THE 3D VERSION OF NASH's PROBLEM can be obtained directly 
from the solution of the 2D version. How? 



4/28/2004, CURL AND DIV 



Math21a, O. Knill 



CURL (3D). The curl of a 3D vector field F = 
(P, Q, R) is defined as the 3D vector field 

curl(P, Q, R) = {R y - Q z , P z -R Xl Q x - P y ) . 

CURL (2D). Recall, the curl of a 2D vector field 
F = (P, Q) is Q x — P y , a scalar field. 




WANTED! Is there a multivariable calculus book in which the above wheel is not shown? The wheel indicates 
the curl vector if F is thought of as a wind velocity field. As we will see later the direction in which the wheel 
turns fastest, is the direction of curl(P). The wheel could actually be used to measure the curl of the vector 
field at each point. In situations with large vorticity like in a tornado, one can "see" the direction of the curl. 



DIV (3D). The divergence of F = (P,Q,R) is the 
scalar field div(P, Q,R) = V • F = P x + Q y + R z . 
DIV (2D). The divergence of a vector field F = 
(P, Q) is div(P, Q) = V • F = P x + Q y . 



NABLA CALCULUS. With the "vector" V = {d x ,d y ,d z ), we can write curl(P) = V x F and div(P) = V • F. 
This is both true in 2D and 3D. 

LAPLACE OPERATOR. Af = divgrad(/) = f xx + f yy + f zz can be written as V 2 / because V ■ (V/) = 
div(grad(/). One can extend this to vectorfields AF = (AP, AQ, AR) and writes V 2 P. 



IDENTITIES. While direct computations can verify the identities to the left, they become evident with Nabla 
calculus from formulas for vectors like vxv = 0,v-vxw = or ux(vxw) = v(u ■ w) — (u ■ v)w. 



div(curl(P)) =0. 




V- V xF = 0. 


curlgrad(P) = 




V x VP = 0. 


curl(curl(P)) = grad(div(P) - A(P)). 




V x V x P = V(V • P) - (V • V)P. 



QUIZZ. Is there a vector field G such that P = (x + y,z,y 2 ) = curl(G)? Answer: no because div(P) = 1 is 
incompatible with divicurl(G)) = 0. 



ADDENDA TO GREEN'S THEOREM. Green's theorem, one of the advanced topics in this course is useful in 
physics. We have already seen the following applications 



• Simplify computation of double integrals 

• Simplify computation of line integrals 

• Formulas for centroid of region 

• Formulas for area 

• Justifying, why mechanical integrators like the planimeter works. 



APROPOS PLANIMETER. 

The cone planimeter is a mechanical instrument to find 
the antiderivative of a function f{x). It uses the fact that 
the vector field F(x,y) = (y,0) has curl(P) = — 1. By 
Greens theorem, the line integral around the type I region 
bounded by and the graph of f(x) in the counter clockwise 
direction is J* f(x) dx. The planimeter determines that 
line integral. 




THERMODYNAMICS. Gases or liquids are often described in a P — V diagram, where the volume in the 
x-axis and the pressure in the y axis. A periodic process like the pump in a refrigerator leads to closed curve 
7 : r(t) = (V(t),p(t)) in the V — p plane. The curve is parameterized by the time t. At a given time, the gas 
has volume V(t) and a pressure p(t). Consider the vector field F(V,p) = (p,0) and a closed curve 7. What is 
J^Fds? Writing it out, we get /*(p(t),0) 1 (V'(t),p'(t) dt = J*p(t)V'(t)dt = J*pdV. The curl of F(V,p) is -1. 
You see by Green's theorem the integral — J Q pdV is the area of the region enclosed by the curve. 

MAXWELL EQUATIONS (in homework you assume no current j = and charges p = 0. c = speed of light. 



div(P) = 


No monopoles 


there are no magnetic monopoles. 


curl(P) = -±P, 


Faraday's law 


change of magnetic flux induces voltage 


curl(P) = ±Et + *f j 


Ampere's law 


current or change of E produces magnetic field 


div(P) = 47rp 


Gauss law 


electric charges are sources for electric field 



2D MAXWELL EQUATIONS? In space dimensions d different than 3 the electromagnetic field has d{d + l)/2 
components. In 2D, the magnetic field is a scalar field and the electric field E = (P, Q) a vector field. The 2D 
Maxwell equations are curl(P) = — ^^P,div(P) = An p. Consider a region R bounded by a wire 7. Green's 
theorem tells us that d/dt j J R B(t) dxdy is the line integral of E around the boundary. But Eds is a voltage. 
A change of the magnetic field produces a voltage. This is the dynamo in 2D. We will see the real dynamo 
next week in 3D, where electromagnetism works (it would be difficult to generate a magnetic field in flatland). 
If v is the velocity distribution of a fluid in the plane, then uj(x, y) = cxn\(v)(x, y) is the vorticity of the fluid. 
The integral J J R cudxdy is called the vortex flux through R. Green's theorem assures that this flux is related 
to the circulation on the boundary. 



FLUID DYNAMICS, v velocity, p density of fluid. 



Continuity equation 


p + div(pv) = 


no fluid get lost 


Incompressibility 


div(v) = 


incompressible fluids, in 2D: v = grad(w) 


Irrotational 


curl(v) = 


irrotation fluids 



A RELATED THEOREM. If we rotate the vector field P = (P, Q) by 90 degrees = tt/2 we get a new vector 
field G = (— Q,P). The integral J^F ■ ds becomes a flux J^G ■ dn of G through the boundary of P, where 
dn is a normal vector with the length of dr. With div(P) = (P x + Q y ), we see that curl(P) = div(G). Green's 
theorem now becomes 

f f div(G) dxdy = f G ■ dn , 

J J R J-y 

where dn(x, y) is a normal vector at (x, y) orthogonal to the velocity vector r'(x, y) at (x, y). This new theorem 
has a generalization to three dimensions, where it is called Gauss theorem or divergence theorem. Don't treat 
this however as a different theorem in two dimensions. It is in two dimensions just Green's theorem 
disguised. There are only 2 basic integral theorems in the plane: Green's theorem and the FTLI. 



PREVIEW. Green's theorem is of the form f R F' = f SR F, where F' is a "derivative" and SR is a "boundary". 
There are 2 such theorems in dimensions 2, three theorems in dimensions 3, four in dimension 4 etc. In the 
plane, Green's theorem is the second one besides the fundamental theorem of line integrals FTLI. In three 
dimensions, there are two more theorems beside the FTLI: Stokes and Gauss Theorems which we will see in 
the next week. 



dim 


dim(R) 


theorem 


ID 


1 


Fund, thm of calculus 



2D 


1 


Fund, thm of line integrals 


2D 


2 


Green's theorem 



dim 


dim(R) 


theorem 


3D 


1 


Fundam. thm of line integrals 


3D 


2 


Stokes theorem 


3D 


3 


Gauss theorem 



1 ^ 1 




derivative 


1 ^ 2 




gradient 


2 ^ 1 


V x P 


curl 



1 ^ 3 


v; 


gradient 


3^3 


V x P 


curl 


3 ^ 1 


V-P 


divergence 



4/28/2004, DIVERGENCE THEOREM 



Math21a, O. Knill 



HOMEWORK. Section 13.8: 6,10,14,30 



DIV. The divergence of a vector field F is div(P, Q,R) = V • F = P x + Q y + R z . It is a scalar field. 

The flux integral of a vector field F through a surface S = r(R) was defined as J J s F ■ dS = J J R F(r(u, v)) ■ 

r u x r v dudv. 

Recall also that the integral of a scalar function / on a region is J J J R f dV = J J J G f(x, y, z) dxdydz. 



GAUSS THEOREM or DIVERGENCE THEOREM. Let G be a region in space bounded by a surface S and 
let F be a vector field. Then 



jjj G ^F )d V = jj s F. dS . 
The orientation of S is such that the normal vector r u x r v points outside of G. 



EXAMPLE. Let F(x,y,z) = (x,y,z) and let S be sphere. The divergence of F is constant 3 and 
J J J G div(-P) dV = 3 • 47r/3 = 47r. The flux through the boundary is J J s r ■ r u x r v dudv = 
J J s \r(u, v)\ 2 sin(v) dudv = J* J Q n sin(v) dudv = 4ir also. 

CONTINUITY EQUATION. If p is the density of a fluid and let v be the velocity field of the fluid, then 
by conservation of mass, the flux J J s v ■ dS of the fluid through a closed surface S bounding a region G is 
d/dtj J J G pdV, the change of mass inside G. But this flux is by Gauss theorem equal to J J f G div(v) dV. 
Therefore, J J J G p — div(i?) dV = 0. Taking a very small ball G around a point (x, y, z), where J J J G f dV ~ 
f(x, y, z) gives p — div(-u). This is called the continuity equation. 

EXAMPLE. What is the flux of the vector field F(x,y,z) = (2x,3z 2 + y,sin(x)) through the box G = [0,3] x 
[0,2] x [-1,1]? 

Answer: Use the divergence theorem: div(F) = 2 and so / / J G div(F) dV = 2 / / J G dV = 2Vol(G) = 24. 
Note: Often, it is easier to evaluate a three dimensional integral than a flux integral because the later needs a 
parameterization of the boundary, which requires the calculation of r u x r v etc. 



PROOF OF THE DIVERGENCE THEOREM. Consider a small box 
[x,x + dx] x [y,y + dy] x [z,z + dz]. Call the sides orthogonal to 
the x axis x-boundaries etc. The flux of F — (P, Q, R) through the 
x-boundaries is [F(x + dx, y, z) ■ (1, 0, 0) + F(x, y, z) ■ (—1, 0, 0)]dydz = 
P(x + dx, y, z) — P(x, y, z) = P x dxdydz. Similarly, the flux through 
the ^-boundaries is P y dydxdz and the flux through the z-boundary 
is P z dzdxdy. The total flux through the boundary of the box is 
(P x +P y + P z )dxdydz = div(F)dxdydz. 

For a general body, approximate it with a union of small little cubes. 
The sum of the fluxes over all the little cubes is sum of the fluxes through 
the sides which do not touch an other box (fluxes through touching sides 
cancel). The sum of all the infinitesimal fluxes of the cubes is the flux 
through the boundary of the union. The sum of all the div(F)dxdydz is 
a Riemann sum approximation for the integral J J J G div(F) dxdydz. In 
the limit, where dx,dy,dz goes to zero, we obtain Gauss theorem. 




VOLUME CALCULATION. Similarly as the planimeter allowed to calculate the area of a region by passing 
along the boundary, the volume of a region can be determined as a flux integral. Take for example the vector 
field F(x, y, z) = (x, 0, 0) which has divergence 1. The flux of this vector field through the boundary of a region 
is the volume of the region, j J §G (x, 0, 0) • dS = Vol(G). 



GRAVITY INSIDE THE EARTH. How much do we weight deep 
in earth at radius r from the center of the earth? (Relevant in 
the movie "The core") The law of gravity can be formulated as 
div(F) = 4irp, where p is the mass density. We assume that the 
earth is a ball of radius R. By rotational symmetry, the gravita- 
tional force is normal to the surface: F(x) = F(r)x/\\x\\. The flux 
of F through a ball of radius r is J J s F{x) ■ dS = 47rr 2 F(r). By 
the divergence theorem, this is AmM r — Att J J J B p(x) dV, 
where M r is the mass of the material inside S r . We have 
(4tt) V 3 /3 = 47rr 2 F(r) for r < R and {47r) 2 pR 3 /3 = 4yrr 2 F(r) for 
r > R. Inside the earth, the gravitational force F(r) = 47rpr/3. 
Outside the earth, it satisfies F(r) = M/r 2 with M = 4ttP 3 p/3. 




WHAT IS THE BOUNDARY OF A BOUNDARY? The fundamental theorem for lineintegral, Green's theorem, 
Stokes theorem and Gauss theorem are all of the form J A dF = J 5A F, where dF is a derivative of F and S A is 
a boundary of A. They all generalize the fundamental theorem of calculus. There is some similarity in how d 
and 5 behave: 



f scalar field 


ddf = curl grad(/) = 


S surface in space 


5S is union of closed curves 


55S = 


F vector field 


ddF = div curl(F) = 


G region in space 


5G is a closed surface 


55G = Q) 



The question when div(F) = 
We look at it Friday. 



implies F = curl(G) or whether curl(F) = implies G = grad(G) is interesting 



STOKES AND GAUSS. Stokes theorem was found by Ampere in 1825. George Gabriel 
Stokes: (1819-1903) was probably inspired by work of Green and rediscovers the identity around 
1840. Gauss theorem was discovered 1764 by Joseph Louis Lagrange. Carl Friedrich 
Gauss, who formulates also Greens theorem, rediscovers the divergence theorem in 1813. Green 
also rediscovers the divergence theorem in 1825 not knowing of the work of Gauss and Lagrange. 






Carl Friedrich Gauss George Gabriel Stokes Joseph Louis Lagrange Andre Marie Ampere 



GREEN IDENTITIES. If G is a region in space bounded by a surface S and /, g are scalar functions, then with 
Af = V 2 / = f xx + f yy + f zz , one has as a direct consequence of Gauss theorem the first and second Green 
identities (see homework) 



JJJ G (fAg + V/ • Vg) dV = J J s fVg ■ dS 



J J J G (fAg - gAf) dV = J J s (fVg - gVf) ■ dS 



These identities are useful in electrostatics. Example: if g = / and Af = and either / = on the boundary 
S or V/ is orthogonal to S, then Green's first identity gives J J j G |V/| 2 dV = which means / = 0. This can 
be used to prove uniqueness for the Poisson equation Ah = 4-irp when applying the identity to the difference 
/ = hi — hi of two solutions with either Dirichlet boundary conditions (h = on S) or von Neumann 
boundary conditions (Vh orthogonal to S). 



GAUSS THEOREM IN HIGHER DIMENSIONS. If G is a n-dimensional "hyperspace" bounded by a (n - 1) 
dimensional "hypersurface" S, then J G div(F)dV = J s F ■ dS. 

DIV. In dimension d, the divergence is defined div(F) = V • F = dFi/dxi. The proof of the n-dimensional 
divergence theorem is done as in three dimensions. 



By the way: Gauss theorem in two dimensions is just a version of Green's theorem. Replacing F = (P, Q) 
with G = (—Q,P) gives curl(F) = div(G) and the flux of G through a curve is the lineintegral of F along the 
curve. Green's theorem for F is identical to the 2D-divergence theorem for G. 



4/30/2004, DIVERGENCE THEOREM II 



Math21a, O. Knill 



Problems: 1,2,3,4,5 in Homework (see homework handout). 



ZWICKY'S VIEW. Fritz Zwicky was an astronomer who also devised the morphological method for creativ- 
ity. The method is one of many tools to solve problems which require new ideas. Zwicky was highly creative 
himself. The idea of the supernovae, dark matter and many pulsar discoveries can be attributed to him. In 
1948 for example, Zwicky suggested to use extraterrestrial sources to reconstruct the universe. This should 
begin with changing other planets, moons and asteroids by making them inhabitable and to change their orbits 
around the sun in order to adjust their temperature. He even suggested to alter the fusion of the sun to make 
displace our own space solar system towards an other planetary system. Zwickies morphological method is 
to organize ideas in boxes like a spreadsheet. For example, if we wanted to make order in the zoo of integral 
theorems we have seen now, we would one coordinate to display the dimension, in which we work and the second 
coordinate the maximal dimension along which we integrate in the theorem. 






1 


2 


3 


1 


Fundamental theorem of calculus 






2 


Fundamental theorem of line integrals 


Greens theorem 




3 


Fundamental theorem of line integrals 


Stokes theorem 


Gauss theorem 






Fundamental thm of line inte- 
grals. We integrate over 1 and 
dimensional objects. The di- 
mensional object has no bound- 
ary. 



Green and Stokes theorems. We 
integrate over 2 and 1 dimen- 
sional objects. The 1 dimen- 
sional object is a closed curve 
and has no boundary. 



Divergence theorem. We inte- 
grate over 3 and 2 dimensional 
objects. The 2 dimensional ob- 
ject is a closed surface and has 
no boundary. 



One could build now for each of the differential operators grad, curl and div such a matrix and see whether there 
is a theorem. Many combinations do not make sense like integrating the curl over a three dimensional object. 
But there is a theorem, which seem have escaped: if we integrate in two dimensions the div(i 7 ') = M x + N y over 
a region R, then there this can be written as an integral over the boundary. There is indeed such a theorem, 
but it is just a version of Greens theorem in disguise (turn the vector field by 90 degrees and replace the line by 
a 2D version of the flux integral). It can be seen as a special case of the divergence theorem in three dimensions 
and it does not make sense to put it on the footing of the other theorems. Anyway, the morphological method 
was used in management like in Ciba, it is today just one of a variety of methods to come up with new ideas. 



IDENTITIES. While direct computations can verify the identities to the left, they become evident with Nab la 
calculus from formulas for vectors like vxv = 0,v-vxw = or ux(vxw) = v(u ■ w) — (u ■ v)w. 



div(curl(F)) = 0. 




V- V x F = 0. 


curl(grad(F)) = 




V x VF = 0. 


curl(curl(F)) = grad(div(F) - A(F)). 




V x V x F = V(V • F) - (V • V)F. 



QUIZZ. Is there a vector field G such that F — (x + y, z, y 2 ) = curl(G)? Answer: no because div(F) = 1 is 
incompatible with divcurl(GQ) = 0. 



BOXED OVERVIEW. All integral theorems are of the form f R F' = J §R F, where F' is a "derivative" and 5R 
is a "boundary". There are 2 such theorems in dimensions 2, three theorems in dimensions 3, four in dimension 
4 etc. 



dim 


dim(R) 


theorem 


ID 


1 


Fund, thm of calculus 



2D 


1 


Fund, thm of line integrals 


2D 


2 


Green's theorem 



dim 


dim(R) 


theorem 


3D 


1 


Fundam. thm of line integrals 


3D 


2 


Stokes theorem 


3D 


3 


Divergence theorem 



1 ^ 1 




derivative 


1 ^ 2 


v/ 


gradient 


2 ^ 1 


V x F 


curl 



1 ^ 3 


v/ 


gradient 


3^3 


V x F 


curl 


3^1 


V F 


divergence 



MAXWELL EQUATIONS, c is the speed of light. 



div(B) = 


No monopoles 


there are no magnetic monopoles. 


curl(E) = -\B t 


Faraday's law 


change of magnetic flux induces voltage 


curl(B) = ±Et + 


Ampere's law 


current or change of E produces magnetic field 


div(E) = 47Tp 


Gauss law 


electric charges are sources for electric field 



MAGNETOSTATICS: curl(£) = so that B = grad(/). But since also div{B) = 0, we have 



Af = div(grad(/)) = 



ELECTROSTATICS. curl(E) = so that E = grad(/). But since also div(£) = 0, we have 



Af = div(grad(/)) = 



Static electric and magnetic fields have a harmonic potential. 



FLUID DYNAMICS, v velocity, p density of fluid. 



Continuity equation 


p + div(pv) = 


no fluid get lost 


Incompressibility 


div(v) = 


incompressible fluids 


Irrotational 


curl(v) = 


no vorticity 


Potential fluids 


v = grad(/),A(/)=0 


incompressible, irrotational fluids 



Incompressible, irrotational fluid velocity fields have a potential which is harmonic 



HARMONIC FUNCTIONS. A function f(x,y,z) is called harmonic, if Af(x,y,z) = 0. Harmonic functions 
play an important role in physics. For example time independent solutions of the Schrodinger equation ihft = 
Af or the heat equation f t = Af are harmonic. In fluid dynamics, incompressible and irrotational fluids 
with velocity distribution v satisfy v = grad(/) and since div(v) = 0, also Af = 0. Harmonic functions have 
properties which can be seen nicely using the divergence theorem. 



MAXIMUM PRINCIPLE. A harmonic func- 
tion in D can not have a local maximum inside 
D. 



MEAN VALUE PROPERTY. The average of a har- 
monic function on a sphere of radius r around a point 
P is equal to the value of the function at the point P. 



You deal with such problems in the homework. They are direct consequences of the divergence theorem. 



THE PIGEON PROBLEM. A farmer drives a closed van through 
the countryside. Inside the van are dozens of pigeons. The van 
weights 200 pounds, the pigeon, an other 100 pounds. The driver 
has to pass a bridge, which can only sustain 250 pounds. The 
driver has an idea: if all the pigeon would fly while crossing the 
bridge, a passage would be possible. Question: does the bridge 
hold? What happens if the cage is open? 

The flight of the pigeon produces a wind distribution in the cage. The question is whether the flying pigeon 
will still produces a weight on the van. 




4/28/2004, GRAVITY INSIDE EARTH 



Math21a, O. Knill 




Since the middle ages, 
people were interested in 
the gravity of earth. Orig- 
inally, the motivation was 
to figure out how life 
would be like in hell 
Nowadays, the inter- 
est has sparked by science 
fiction movies like "the 
core" , where people travel 
to the core of the earth. 




GRAVITY INSIDE THE EARTH. How much do we weight deep 
in earth at radius r from the center of the earth? (Relevant in the 
movie " The core" ) The law of gravity can be formulated as 



div(F) = 4irp 



where p is the mass density. 

We assume that the earth is a ball of radius R. By rotational 
symmetry, the gravitational force is normal to the surface: 

F(x) = F(r)x/\\x\\ . 

The flux of F through a ball of radius r is 

F(x) ■ dS = 47rr 2 F{r) 



IL 




By the divergence theorem, this is 4.nM r — 4n J J f B p{x) dV, 
where M r is the mass of the material inside S r . We have 
(4tt) V 3 /3 = 4irr 2 F{r) for r < R and {4>ir) 2 pR 3 /3 = 47rr 2 F(r) 
for r>R. 

Inside the earth, the gravitational force F(r) = 47rpr/3. Outside 
the earth, it satisfies F(r) = M/r 2 with M = 4ttR 3 p/3. 



We have seen above, how the gravitational 
field behaves inside a homogeneous body. This 
would have been a difficult computation with- 
out the divergence theorem. 
Since we know more about the interior of other 
stars then the interior of our earth, people still 
think about sending probes to the earth cen- 
ter. One of the first proposals was by Zwicky 
who wanted to use tunnels through the earth 
for travel purposes. One of the recent propos- 
als is to send liquid metal down a crack and 
let this propagate. 



seismic 
communication 




propagating 
crack 



embedded probe 




A nuclear explosion or earthquake might 
reveal the Earth's innards 



MICHAEL HOPKIN 

Abstract: Cracking idea gets to the core Astronomer says time is ripe for an unmanned journey to the centre 
of the Earth. 



We have sent probes to Mars, the outer planets and even beyond the farthest reaches of the Solar System - but 
never to the centre of our own world. Now a planetary scientist has a radical plan to redress the balance. 

Outer space is trillions upon trillions of times bigger than the Earth's interior, and yet we know more about 
what's out there than we do about what's under our feet, says David Stevenson of the California Institute of 
Technology in Pasadena. 

The Earth's core is "crammed with interesting stuff", he says. For example, swirling currents of liquid metal 
3,000-5,000 kilometers below the surface are thought to generate the planet's magnetic field. 

So Stevenson suggests we blast a crack in the Earth's surface and pour in thousands of tonnes of molten iron. 
The iron-filled crack would burrow downwards, closing up behind itself as it descends, and reaching the bowels 
of the planet in about a week. 

Concealed in this seething blob of iron would be a grapefruit-sized probe containing instruments to measure 
temperature, electrical conductivity and chemical composition. The device would send data as sound waves to 
a surface detector, which would filter the signal from the Earth's natural rumblings. 

Starting the crack would be the tricky part, says Stevenson. He calculates that it would take a blast equivalent 
to several megatonnes of TNT, an earthquake of magnitude 7 on the Richter scale, or a nuclear device such as 
those already possessed by many nations. 

Getting hold of the iron would also be an impressive logistical feat - Stevenson reckons that the experiment 
would need to commandeer all of the world's iron foundries for anything from an hour to a week. He is confi- 
dent, however, that the molten reservoir's sheer size would stop it solidifying before it is poured into the ground. 

It's an interesting proposition, comments Allan Rubin, who studies the Earth's structure at Princeton 
University in New Haven, Connecticut, but it does make some rather blithe assumptions. "We don't even know 
for definite that the lower mantle can fracture," he points out. 

Stevenson claims that the cost of his plan would be small beans compared with the riches lavished on space 
exploration. But he offers prospective investors no guarantee of success: "Frankly, I would be surprised if this 
really works," he admits. 



References 

1. Stevenson, D. J. Mission to Earth's core - a modest proposal. Nature, 423, 239 - 240, (2003). 



Source: Nature News Service / Macmillan Magazines Ltd, 2003 



REVIEW BEYOND VECTOR FIELDS 



Math21a,O.Knill 



INTEGRATION. 

Line integral: J c F ■ ds = J b F(r(t)) ■ r'{t) dt 

Surface integral J J s fdS = J a f(r(u,v))\r u (u,v) x r v (u,v)\ dudv 

Flux integral: J J s F ■ dS = J b F(r(u, v)) ■ r u (u, v) x r v (u, v) dudv 

Double integral: J J R f dA = j a J c f(x,y) dxdy. 



Triple integral: 



SSS R fdV = Si S d c SI /(*,», z) dxdydz. 



Area J J R ldA 
I I R 1 dxdy 
Length J b \r'(t)\ dt 
Surface area J J IdS - 

J J \r u x r v \ dudv 
Volume / / f B l dV ■- 
J J J B I dxdydz 



DIFFERENTIATION. 
Derivative: 
Partial derivative: 
Gradient: 
Curl in 2D: 
Curl in 3D: 
Div: 



/'(*) = f(t) = d/dtf(t). 

f x (x,y,z) = %L(x,y,z). 

grad(/) = (f x J y J 2 ) 

curl(F) = curl((M, N)) = N X -M y 

curl(F) = curl(M, N, P) = (P y -N Z ,M Z 

div(F) = div(M, N, P) = M X + N y + P z . 



-P x ,N x ~My) 



IDENTITIES. 

div(curl(F)) = 

curl(grad(/)) 

(0,0,0) 

div(grad(/)) = A/. 



NOTIONSS OF CONSERVATIVE. 1) Gradient: F = grad(/). 

2) Closed curve property: J c F ■ dr = for any closed curve. 

3) Conservative: Ci paths from A to B, then J c ^ F ■ dr = J c ^ F ■ dr. 

4) Mixed derivative property: curl(F) = in simply connected regions. 



Closed loop 
property 



TOPOLOGY. 1) Interior of region D: points which have a neighborhood contained in D. 

2) Boundary of a curve: endpoints of curve, boundary of 2D region D: curves which bound the region, 
boundary of a solid D: surfaces which bound the solid. 

3) Simply connected region D: a closed curve in D can be deformed within the interior of D to a point. 

4) Closed curve Curve without boundary. 

5) Closed surface surface without boundary. 

LINE INTEGRAL THEOREM. If C : r(t) = (x(t),y(t), z(t)),t e [a,b] is a curve and / is a function either in 
3D or the plane. Then 



/ Vf-ds = f(r(b))-f(r(a)) 
Jc 



CONSEQUENCES. 

1) For closed curves the line integral J c V/ • ds is zero. 

2) Gradient fields are conservative: if F = V/, then the line integral between two points P and Q is path 
independent. 

3) The theorem holds in any dimension. In one dimension, it reduces to the fundamental theorem of calculus 

J b a f(x)dx = f(b)-f(a) 

4) The theorem justifies the name conservative for gradient vector fields. 

5) The term "potential" was coined by George Greeen (1783-1841). 

PROBLEM. Let f(x, y, z) = x 2 + y 4 + z. Find the line integral of the vector field F(x, y, z) = V/(x, y, z) along 
the path r(t) = (cos(5£), sin(2£), t 2 ) from t = to t = 2tt. 

SOLUTION. r(0) = (1,0,0) and r{2n) = (1,0, 4vr 2 ) and /(r(0)) = 1 and /(r(2vr)) = 1 + 4tt 2 . FTLI gives 
J c Vf ds = f(r(27r))-f(r(0)) = 47r 2 . 



GREEN'S THEOREM. If R is a region with boundary C and F = (M, N) is a vector field, then 



curl(F) dA= F-ds 



L 



REMARKS. 

1) Useful to swap 2D integrals to ID integrals or the other way round. 

2) The curve is oriented in such a way that the region is to your left. 

3) The region has to has piecewise smooth boundaries (i.e. it should not look like the Mandelbrot set). 

4) If C : r(t) = (x(t),y(t)), the line integral is f b {M{x{t),y{t)), N(x(t),y(t)) ■ (x'(t), y'{t)) dt. 

5) Green's theorem was found by George Green (1793-1841) in 1827 and by Mikhail Ostrogradski (1801-1862). 

6) If curl(F) = in a simply connected region, then the line integral along a closed curve is zero. If two curves 
connect two points then the line integral along those curves agrees. 

7) Taking F(x,y) = (—y,0) or F(x,y) = (0,x) gives area formulas. 



PROBLEM. Find the line integral of the vector field F(x,y) = (x 4 + sin(ar) + y,x + y 3 ) along the path 
r{t) = (cos(t), 5 sin(t) + log(l + sin(t))), where t runs from t = to t = it. 

SOLUTION. curl(F) = implies that the line integral depends only on the end points (0,1), (0,-1) of the 
path. Take the simpler path r(t) = (— t, 0),£ = [—1, 1], which has velocity r'{t) = (—1,0). The line integral is 
J^{t 4 - sin(t), -t) ■ (-1,0) dt = -£ 5 /5|li = -2/5. 

REMARK. We could also find a potential f(x,y) = x 5 /5 — cos(x) + xy + ?/ 5 /4. It has the property that 
grad(/) = F. Again, we get /(0, -1) - /(0, 1) = -1/5 - 1/5 = -2/5. 



STOKES THEOREM. If S is a surface in space with boundary C and F is a vector field, then 



/ f curl(F) • dS = f F-ds 
J Js Jc 



REMARKS. 

1) Stokes theorem implies Greens theorem if F is z independent and S is contained in the z-plane. 

2) The orientation of C is such that if you walk along C and have your head in the direction, where the normal 
vector r u x r v of S, then the surface to your left. 

3) Stokes theorem was found by Andre Ampere (1775-1836) in 1825 and rediscovered by George Stokes (1819- 
1903). 4) The flux of the curl of a vector field does not depend on the surface S, only on the boundary of S. 
This is analogue to the fact that the line integral of a gradient field only depends on the end points of the curve. 
5) The flux of the curl through a closed surface like the sphere is zero: the boundary of such a surface is empty. 



PROBLEM. Compute the line integral of F{x,y,z) = (x 3 + xy,y,z) along the polygonal path C connecting 
the points (0, 0, 0), (2, 0, 0), (2, 1, 0), (0, 1, 0). 

SOLUTION. The path C bounds a surface S : r(u,v) = (u,v,0) parameterized by R = [0,2] x [0,1]. By 
Stokes theorem, the line integral is equal to the flux of curl(F)(x, y, z) = (0,0, —x) through S. The normal 
vector of S is r u x r v = (1, 0, 0) x (0, 1, 0) = (0, 0, 1) so that / f s curl(F) dS = J Q 2 /^(0, 0, -u) ■ (0, 0, 1) dudv = 
Jo Jq-u dudv = -2. 



GAUSS THEOREM. If S is the boundary of a region B in space with boundary S and F is a vector field, then 



Jff B wr = fJ 



F-dS 

s 



REMARKS. 

1) Gauss theorem is also called divergence theorem. 

2) Gauss theorem can be helpful to determine the flux of vector fields through surfaces. 

3) Gauss theorem was discovered in 1764 by Joseph Louis Lagrange (1736-1813), later it was rediscovered by 
Carl Friedrich Gauss (1777-1855) and by George Green. 

4) For divergence free vector fields F, the flux through a closed surface is zero. Such fields F are also called 
incompressible or source free. 



PROBLEM. Compute the flux of the vector field F(x, y, z) = (—x, y, z 2 ) through the boundary S of the 
rectangular box [0,3] x [-1, 2] x [1, 2]. 

SOLUTION. By Gauss theorem, the flux is equal to the triple integral of div(i 7 ') = 2z over the box: 
J 3 J 2 _ x Jl 2z dxdydz = (3 - 0)(2 - (-1))(4 - 1) = 27. 



5/3/2004, INTEGRAL THEOREMS OVERVIEW 



Math21a,O.Knill 



HOMEWORK: Problems 6,7,8 in additional homework handout. 



SUMMARY. The FTLI, Green, Stokes and Gauss theorem generalize the fundamental theorem of calculus and 
are all of the form 



IdR F = Ir ^ ' w here dR is the boundary of R and dF is a derivative of F. 



INTEGRATION. 

Line integral: J c F ■ ds = J* F(r(t)) ■ r'(t) dt 

Surface integral J J s fdS = J a J c f(r(u,v))\r u (u,v) x r v (u,v)\ dudv 

Flux integral: J J s F ■ dS = f a f^ F(r(u,v)) -r u (u,v) x r v (u,v) dudv 

Double integral: J J R f dA = J a J c f(x,y) dxdy. 

Triple integral: / / J R f dV = $1 ^ So f(x, y, z) dxdydz. 



Area J J R ldA 
I Ir 1 dxdy 

Surface area J J IdS - 

J J \t u x r v\ dudv 
Volume J J J B 1 dV - 
JJ/b 1 dxdydz 



DIFFERENTIATION. 
Derivative: 
Partial derivative: 
Gradient: 
Curl in 2D: 
Curl in 3D: 
Div: 



f(t) = f(t) = d/dtf(t). IDENTITIES. 

fx(x, y, z) = %t{x, y, z). div(curl(F)) = 

grad(/) = (f x , f y , f z ) curl(grad(/)) 

curl(F) = curl((M, N)) = N x - M y ( , , 0) 
curl(F) = curl(M, TV, P) = (P y - N Z1 M Z -P X1 N X - M y ) 
div(F) = div(M, N, P) = M x + N y + P z . 



LINE INTEGRAL THEOREM. If C : r{t) = (x(t),y(t), z(t)),t e [a,b] is a curve and / is a function either in 
3D or the plane. Then 



/ Vf-ds = f(r(b))-f(r(aj) 



CONSEQUENCES. 

1) If the curve is closed, then the line integral J V/ • ds is zero. 

2) If F = V/, the line integral between two points P and Q does not depend on the chosen path. 



REMARKS. 

1) The theorem holds in any dimension. In one dimension, it 
reduces to the fundamental theorem of calculus 



f a f(x) dx = f(b)-f(a) 



2) The theorem justifies the name conservative for gradient vector fields. 

3) In physics, / is the potential energy and V/ a force. The theorem says that such forces lead to energy 
conservation. 

4) The term "potential" was coined by George Greeen (1783-1841). 



PROBLEM. Let f(x, y, z) = x 2 + y 4 + z. Find the line integral of the vector field F(x, y, z) = Vf(x, y, z) along 
the path r(t) = (cos(5£), sin(2£), t 2 ) from t = to t = 2tt. 

SOLUTION. r(0) = (1,0,0) and r(27r) = (1,0, 4tt 2 ) and /(r(0)) = 1 and /(r(27r)) = 1 + 4tt 2 . Applying the 
fundamental theorem gives J V/ ds = /(r(2-7r)) — /(r(0)) = 4tt 2 . 



GREEN'S THEOREM. If R is a region with boundary 7 and F = (M, iV) is a vector field, then 



J J curl(F) dA = J F 



ds 



REMARKS. 

1) Useful to swap 2D integrals to ID integrals or the other way round. 

2) The curve is oriented in such a way that the region is to your left. 

3) The region has to has piecewise smooth boundaries (i.e. it should not look like the Mandelbrot set). 

4) If 7 : 1 1-> r(t) = (x(t),y(t)), the line integral is f*(M(x(t),y(t)),N(x(t),y(t)) ■ (x'(t),y'(t)) dt. 

5) Green's theorem was found by George Green (1793-1841) in 1827 and by Mikhail Ostrogradski (1801-1862). 



CONSEQUENCES. 

1) If curl(F) = in a simply connected region, then the line integral along a closed curve is zero. If two curves 
connect two points then the line integral along those curves agrees. 

2) Taking F(x,y) = (—y,0) gives an area formula Area(R) = J —y dx. Similarly Area(^4) = J xdy. 



PROBLEM. Find the line integral of the vector field F(x,y) = (x 4 + sin(x) + y,x + y 3 ) along the path 
r(t) = (cos(t), 5 sin(t) + log(l + sin(t))), where t runs from t = to t = ir. 

SOLUTION. curl(F) = implies that the line integral depends only on the end points (0,1), (0,-1) of the 
path. Take the simpler path r(t) = (— £,0),£ = [—1, 1], which has velocity r'(t) = (—1,0). The line integral is 
J^{t 4 - sin(t), -t) • (-1,0) dt = -£ 5 /5|!i = -2/5. 

REMARK. We could also find a potential f(x,y) = x 5 /5 — cos(ic) + xy + ?/ 5 /4. It has the property that 
grad(/) = F. Again, we get /(0, -1) - /(0, 1) = -1/5 - 1/5 = —2/5. 



STOKES THEOREM. If £ is a surface in space with boundary 7 and F is a vector field, then 



J J curl(F) -dS = J F-ds 



REMARKS. 

1) Stokes theorem reduces to Greens theorem if F is z independent and S is contained in the z-plane. 

2) The orientation of 7 is such that if you walk along 7 and have your head in the direction, where the normal 
vector r u x r v of S points, then you have the surface to your left. 

3) Stokes theorem was found by Andre Ampere (1775-1836) in 1825 and rediscovered by George Stokes (1819- 
1903). 



CONSEQUENCES. 

1) The flux of the curl of a vector field does not depend on the surface S, only on the boundary of S. This is 
analogue to the fact that the line integral of a gradient field only depends on the end points of the curve. 

2) The flux of the curl through a closed surface like the sphere is zero: the boundary of such a surface is empty. 



PROBLEM. Compute the line integral of F(x, y, z) = (x 3 + xy, y, z) along the polygonal path 7 connecting the 
points (0, 0, 0), (2, 0, 0), (2, 1, 0), (0, 1, 0). 

SOLUTION. The path 7 bounds a surface S : r(u,v) = (u,v,0) parameterized by R = [0,2] x [0,1]. By 
Stokes theorem, the line integral is equal to the flux of cui\(F)(x,y,z) = (0,0, —x) through S. The normal 
vector of S is r u x r v = (1, 0, 0) x (0, 1, 0) = (0, 0, 1) so that / J s curl(F) dS = J Q 2 /^(0, 0, -u) ■ (0, 0, 1) dudv = 
Jq Jq-u dudv = -2. 



GAUSS THEOREM. If S is the boundary of a region B in space with boundary S and F is a vector field, then 



1 1 L* v(F)dv =l L FdS 



REMARKS. 

1) Gauss theorem is also called divergence theorem. 

2) Gauss theorem can be helpful to determine the flux of vector fields through surfaces. 

3) Gauss theorem was discovered in 1764 by Joseph Louis Lagrange (1736-1813), later it was rediscovered by 
Carl Friedrich Gauss (1777-1855) and by George Green. 



CONSEQUENCE. 

For divergence free vector fields F, the flux through a closed surface is zero. Such fields F are also called 
incompressible or source free. 



PROBLEM. Compute the flux of the vector field F(x, y, z) = (—x, y, z 2 ) through the boundary S of the 
rectangular box [0,3] x [-1, 2] x [1, 2]. 

SOLUTION. By Gauss theorem, the flux is equal to the triple integral of div(i 7 ') = 2z over the box: 
J 3 5 2 _ x ft 2z dxdydz = (3 - 0)(2 - (-1))(4 - 1) = 27. 



5/3/2003, INTEGRAL THEOREMS: PEOPLE & GOSSIP Math21a, O. Knill 



AMPERE. Andre Marie Ampere (1775-1836) made contributions to the theory 
of electricity and magnetism. Stokes theorem was found by Ampere in 1825. 



Gossip: 



• While still only 13 years old Ampere submitted his first paper to the Academie de Lyon. Amperes father 
died during the French Revolution. He went to the guillotine writing to Ampere's mother from his cell: 
"I desire my death to be the seal of a general reconciliation between all our brothers; I pardon those who 
rejoice in it, those who provoked it, and those who ordered it.... " 

• Ampre had a difficult time with his daughter. She married one of Napoleon's lieutenants who was an 
alcoholic. The marriage soon was in trouble. Ampere's daughter fled to her father's house and Ampere 
allowed later her husband to live with him also. This proved a difficult situation, led to police intervention 
and much unhappiness for Ampere. 



GAUSS. Carl Friedrich Gauss (1777-1855) worked in a wide variety of fields 
both in mathematics and physics like number theory, analysis, differential ge- 
ometry, geodesy, magnetism, astronomy and optics. Gauss theorem was dis- 
covered 1764 by Joseph Louis Lagrange. 




Gossip: 



• At the age of seven, while starting elementary school his potential was noticed almost immediately. His 
teacher, Biittner, and his assistant, Martin Bartels, were amazed when Gauss summed up the integers 
from 1 to 100 instantly by spotting that the sum was 50 pairs of numbers each pair summing to 101. 

• It seems that Gauss knew already about non-Euclidean geometry at the age of 15. 

• Gauss married twice. Both wives died, the first from the birth of a child, the second after a long illness. 



GREEN. George Green (1793-1841) discovered Green's theorem in 1827. Green 
also rediscovers the divergence theorem in 1825 not knowing of the work of 
Gauss and Lagrange. 



Gossip: 



• Green studied mathematics on the top floor of his fathers mill, entirely on his own. The years between 
1823 and 1828 were not easy for Green: As well as having a full time job in the mill, two daughters were 
born, the one in 1824 mentioned above, and a second in 1827. Between these two events his mother had 
died in 1825 and his father was to die in 1829. Yet despite the difficult circumstances and despite his 
flimsy mathematical background, Green published one of the most important mathematical works of all 
time in 1828. 

• Green was the son of a baker: from a short obituary in the Nottingham Review which showed they knew 
little of his life and less of the importance of his work. "... Through Thomson, Maxwell, and others, the 
general mathematical theory of potential developed by an obscure, self-taught millers son would lead to 
the mathematical theories of electricity underlying twentieth-century industry. 



LAGRANGE. Joseph-Louis Lagrange (1736-1813) worked in all fields of anal- 
ysis and number theory and analytical and celestial mechanics. Gauss theorem 
was discovered in 1764 by Joseph Louis Lagrange. 



Gossip: 



Lagrange is usually considered a French mathematician. But some refer to him as an Italian mathematician 
because Lagrange was born in Turin. Only the intervention of Lavoisier saved Lagrange during the French 
revolution from the guillotine because a law was established arresting all foreign born people. Lavoisier 
himself was guillotined shortly after. 

Lagrange first wife was his cousin. They had no children. In fact, Lagrange had told d'Alembert in this 
letter that he did not wish to have children. His first wife died after years of illness. 

Napoleon named Lagrange to the Legion of Honor and Count of the Empire in 1808. On April 3, 1813 
Lagrange was named "grand croix of the Ordre Imperial de la Reunion". He died a week later. 



OSTROGRADSKI. Mikhail Vasilevich Ostrogradski (1801-1862) worked on in- 
tegral calculus, mathematical physics like hydrodynamics and partial differen- 
tial equations. He worked also in algebra and differential equations, theoretical 
mechanics and wrote several textbooks. 




Gossip: 



• Ostrogradski wanted actually to pursue a military carrier. Financial considerations however led him to 
a career in the civil service which needed a university education. He studied physics and mathematics. 
He successfully finished his exams but the minister of religious affairs and national education refused to 
confirm the decision. Ostrogradski therefore never got a degree. 

• He left Russia to study in Paris, between 1822 and 1827 Ostrogradski attended lectures by Laplace, 
Fourier, Legendre, Poisson and Cauchy. 



STOKES. George Gabriel Stokes (1819-1903) established the science of hydro- 
dynamics with his law of viscosity describing the velocity of a small sphere 
through a viscous fluid. Stokes discovery of Stokes theorem (around 1840) was 
probably inspired by work of Green. 



Gossip: 



• In 1857 Stokes had to give up his fellowship at Pembroke College because fellows at Cambridge had then 
to be unmarried. Later, after a change in the rules in 1862 Stokes was able to take up the fellowship at 
Pembroke again. 

• While Stokes became engaged, he used to express his feelings to his bride Mary Susanna Robinson in 
rather mathematical terms: "I too feel that I have been thinking too much of late, but in a different way, 
my head running on divergent series, the discontinuity of arbitrary constants ... ". Such words did not 
reveal the love that Mary hoped to find in them and when Stokes wrote her a 55 page (!) letter about the 
duty he felt towards her, she came close to calling off the wedding. 



SOURCES. The sources include E.T. Whittaker's A History of the Theories of Ether and Electricity and the 
MacTutor History of Mathematics http://turnbull.dcs.st-and.ac.uk/history/. 



5/5/2004 INTEGRAL THEOREM PROBLEMS 



Math21a, O. Knill 



HOMEWORK. Problems 9,10,11 on the homeworksheet. 



SUMMARY. This is a collection of problems on line integrals, Green's theorem, Stokes theorem and the diver- 
gence theorem. Some of them are more challenging. 



LINE INTEGRALS GREEN THEOREM. 

The curve r(t) = (cos 3 (£), sin 3 (£)) is called a hypocycloid. It bounds a 
region R in the plane. 

a) Calculate the line integral of the vector field F(x,y) = (x,y) along 
the curve. 

b) Find the area of the hypocycloid. 

a) Because curl(F) = the result is zero by Green's theorem. 

b) Use the vector field F(x,y) = (0,x) which has curl(F) = 1. The 
line integral is F(r(t)) ■ r'{t) dt = cos 3 (£)3 sin 2 (t) cos(t) dt 
= J Q 27r 3cos 4 (£) sin 2 (£) dt = 37r/8. (To compute the integral, use that 
8cos 4 (£)sin 2 (£) = cos(2£) sin 2 (2£) + sin 2 (2£)). 




LENGTH OF CURVE AND LINE INTEGRALS. 

Assume C : t h+ r{t) is a closed path in space and F(r(t)) is the unit 
tangent vector to the curve (that is a vector parallel to the velocity 
vector which has length 1). 

a) What is J c F dr? 

b) Can F be a gradient field? 

Answer: 

a) F(r(t)) = r'(t)/\r'(t)\. By definition of the line integral, 

J F(r(t)) ■ r'{t) dt = £ • r'(t) dt = £ \r'(t)\ dt , 

which is the length of the curve. 

b) No: If F were a gradient field, then by the fundamental theorem of 
line integrals, we would have that the line integral along a closed curve 
is zero. But because this is the length of the curve, this is not possible. 




SURFACE AREA AND FLUX. 

Assume S : (u, v) h+ r(u, v) is a closed surface in space and F(r(u, v)) is 
the unit normal vector on S (which points in the direction of r u x r v ). 

a) What is J J g F ■ dS? 

b) Is it possible that F is the curl of an other vector field? 

c) Is it possible that div(F) = (0, 0, 0) everywhere inside the surface. 



Answer: 

a) F(r(u,v)) = (r u x r v )/\r u 
Jf s F.dS = ff R F(r(u,v))- 



x r v \. By definition of the flux integral, 
■ u x r v = J J R (r u xr v /\r u xr v \)-r u xr v = 
I Ir \ r u x r v \ dudv which is the area of the surface. 

b) No, if F were the curl of an other field G, then the flux of F through 
the closed surface would be zero. But since it is the area, this is not 
possible. 

c) From the divergence theorem follows that div(P) is nonzero some- 
where inside the surface. 




GREENS THEOREM AND LAPLACIAN. 



Assume R is a region in the plane and let n denote the unit normal 
vector to the boundary C of R. For any function u(x,y), we use the 
notation df/du = grad(n) • n which is the directional derivative of u into 
the direction n normal to C. We also use the notation Aw = u xx + u vv . 



Show that 



L 



'n • dr = 



I J. 



Au dA 



Answer: Define F(x,y) = (-B,A) \idu/dn = (A,B). The left integral 
is the line integral of F along C. The right integral is the double integral 
over Au = curl(.F). 



STOKES THEOREM OR DIVERGENCE THEOREM 

Find / f s curl(F) • dS, where S is the ellipsoid x 2 + y 2 + 2z 2 = 10 and 

F(x, y, z) = (sm(xy), e x , —yz). 

Answer. The integral is zero because the boundary of S is empty. This 
fact can be seen using Stokes theorem. It can also been seen by diver- 
gence theorem 

J J curl(F) -dS = J J J divcurl(F) dV . 

using div(curl) (_F) = 0. 



AREA OF POLYGONS. 

If Pi = (xi,yi),i = 1, . . . , n are the edges of a polygon in the plane, then 
its area is A = Y^i( x i ~ x i+ i){y i+ i + yi)/2. 

The proof is an application of Green's theorem. The line integral 
of the vector field F(x,y) = (—y,0) through the side P^P^+i is 
(xi — Xi + i)(yi + i + yi)/2, because — X{) is the projected area onto 

the x-axis and (yi+i + y%)/2 is the average value of the vector field on 
that side. Because cuil(F)(x,y) = 1 for all (x,y), the result follows 
from Greens theorem. 

The result can also be seen geometrically: (xi — Xi+±)(yi+i +yi)/2 is the 
signed area of the trapezoid (xi, 0), (xi + i, 0), (a^+i, yi+i)- In the picture, 
we see two of them. The second one is taken negatively. 



VOLUME OF POLYHEDRA. 

Verify with the divergence theorem: If Pi = (xi,yi,Zi) are the edges 
of a polyhedron in space and Fj = {Pi ± , ■■■P% k . } are the faces, then 
V = AjZj where Aj is the area of the ^-projection (*) of the 
polygon Fj and Zj = (z^ + ... + Zk^/kj is the average z value of the 
face Fj. 

Solution. The vector field F(x,y,z) = z has divergence 1. The flux 
through a face F is \Fj\{z^ + ... + Zk )/kj. Gauss theorem assures that 
the volume is the sum of the fluxes AjZj through the faces. 

(*) The projection of a polygon is the "shadow" when projecting from space 
along the z-axes onto the xy-plane. A triangle (1, 0, 1), (1, 1, 0), (0, 1, 2) for 
example would be projected to the triangle (1,0), (1, 1), (0, 1). 



STOKES AND GAUSS TOGETHER. 



Can you derive div(curl(F)) = using Gauss and Stokes theorem? 
Consider a sphere S of radius r around a point (x,y,z). It bounds a 
ball G. Consider a vector field F. The flux of curl(F) through S is 
zero because of Stokes theorem. Gauss theorem tells that the integral of 
/ = div(curl(F)) over G is zero. Because S was arbitrary, / must vanish 
everywhere. 



FUNDAMENTAL THEOREM AND STOKES. 

Can you derive the identity curl(grad(F)) = from integral theorems? 

To see that the vector field G = curl(grad(F)) = is identically zero, it 
is enough to show that the flux of G trough any surface S is zero. By 
Stokes theorem, the flux trough S is j c grad(F) -dr. By the fundamental 
theorem of line integrals, this is zero. 



VOLUME COMPUTATION WITH GAUSS. 

Calculate the volume of the torus T(a, b) enclosed by the surface 
r(u,v) = ((a + 6cos(v)) cos(w), (a + 6cos(v)) sin(w), 6sin(v)) using Gauss 
theorem and the vector field F(x, y, z) = (x, y, 0)/2. 

The vector field F has divergence 1. The parameterization of the torus 
gives 

r u x r v = b(a + 6cos(v))(cos(w) cos(i>), cos(v) sin(w), sin(u)) . 

The flux of this vectorfield through the boundary of the torus is 
lo* lo^ b( a + bcos(v)) 2 cos(v) dudv = 2ir 2 ab 2 . 



GAUSS OR STOKES? 



You know that the flux of the vector field G = curl(F)(x, y, z) through 
5 faces of a cube D is equal to 1 each. What is the flux of the same 
vector field G through the 6'th face? 

Solution: the problem is best solved with the divergence theorem: be- 
cause the flux of G through the entire surface is zero, the flux through 
the 6'th face must cancel the sum of the fluxes 5 through the other 5 
surfaces. The result is —5. 



WORK COMPUTATION USING STOKES. 



Calculate the work of the vector field F(x,y,z) = (x — y + z,y — 
z + x, z — x + y), along the path C which connects the points 
(1, 0, 0) -»• (0, 1, 0) -> (0, 0, 1) -+(1,0, 0) in that order. 

Answer. The line integral over each part is each 1. The total is 3. 
curl(F) = (2, 2, 2) and S : (u, v) ^ r(u, v) = (u, v, 1 — u — v) r u x r v = 
(1, 1,1) j s curl(F) • r u x r v dudv = 6 area ofS = 3, 



STOKES OR GAUSS? 

Compute the flux of the vector field F(x, y, z) = (x — 
xsm(s'm(z)),2y,3z + s'm(s'm(z))) through the upper hemisphere S = 
{ (x, y, z)\ x 2 + y 2 + z 2 = 1, z > 0}. 

Answer. We use Gauss: div(F) = 6 and / / J B div(F) dV = QVol(B) = 
47r. We can not easily compute the flux through the hemisphere. How- 
ever, we can see that the flux through the floor of the region is zero 
because the normal component P of of the vector field F = (M, N, P) 
is zero on z = 0. So: the result is 4tt — = 4tt. 



GREENS THEOREM. 

Calculate the work of the vector field F(x,y) = x2 ]_ y 2 {—y, x) along the 
boundary of the ellipse r(t) = (3cos(t) + sin(t), 5sin(£) + cos(t)). 

Solution. Take an other curve C : x 2 -\-y 2 < 4 and apply Green's theorem 
to the region R bounded by the ellipse and the circle. Because curl(F) 
is zero in D, the line integral along the ellipse is the same as the line 
integral along the circle: t ^ r(t) = (2 cos(t), 2 sin(t)) with velocity 
r'{t) = (-2sin(£),2cos(£)): 

r , f 27r (-2cos(£),2sin(£)) , n . f 27r , 

jF dr = J y -j ^-(-2sin(t),2cos(t) = J 1 dt = 2tt . 



TRUE/FALSE QUESTIONS ON INTEGRAL THEOREMS. 

(TF) The flux of the curl of a vector field through the unit sphere is zero. 

(TF) The line integral of the curl of a vector field along a closed curve is zero. 

(TF) The line integral J c F ■ dr is independent of how a curve C : t ^ r(t) is parametrized. 

(TF) The maximal speed of a curve is independent on how the curve is parametrized. 

(TF) The flux integral f g F • dS through a surface is independent on how the surface S is parametrized. 

(TF) The area J s dS of a surface is independent on how the surface S is parametrized. 

(TF) The maximal value of r u x r v on a surface S is independent on how the surface is parametrized. 

(TF) There exists a vector field in space which has zero divergence, zero curl but is not a constant field. 

(TF) There exists a vector field in space which has zero gradient but is not a constant vector field 
F(x, y, z) = (a, 6, c). 

(TF) There exists a function in space which has zero Laplacian f xx + f yy + f zz = but which is not constant. 
(TF) div(grad(F)) = and div(curl(F)) = and curl(grad(F)) = 0. 

(TF) The line integral of a gradient field along any part of a level curve F = const is zero. 
(TF) If div(F) = 0, then the line integral along any closed curve is zero. 
(TF) If curl(F) = 0, then the line integral along any closed curve is zero. 
(TF) If div(-F) = then the flux integral along any sphere in space is zero. 
(TF) If curl(F) = then the flux integral along any sphere in space is zero. 





OBJECTS / : R n -> R m IN MULTI VARIABLE CALCULUS Math21a, 2004 



SCALAR FUNCTION (ID), (r. 
where each /(#) is a real numbe 
derivative is /'(x). 



= l,m = 1). A function of one variable, 
. The graph is a curve in the x, y plane. The 



\JV4\j 



SCALAR FUNCTION (2D), (n = 2,m = 1). A function f(x,y) defined in the 
plane is also called a scalar field. The graph of / is a curve in space (see figure). 
Level curves are curves in the plane. A derivative is Vf(x,y), the gradient. 



SCALAR FUNCTION (3D), (n = 3,m = 1). A function f(x,y,z) defined 
in space is also called a scalar field. Its graph would be an object in 4D. 
Level surfaces are surfaces in space (see figure). A derivative is the gradient 
Vf(x,y,z). 




VECTOR FIELD (2D), (n = 2,ra = 2) A function on the plane which 
attaches a vector (P(x,y),Q(x,y)) to each point (x,y). Derivatives are 
curl(F) = Q x — P y or div(F) = P x + Q y both scalar fields (real valued func- 
tions). Also coordinate changes T(u,v) = (x(u,v),y(u,v)) are the same 
mathematical object. 



VECTOR FIELD (3D), (n = 3,m = 3) At each point in space, we attach 
a vector F(x,y,z) = (P(x,y,z),Q(x,y,z),R(x,y,z)) at each point (x,y,z). 
Derivatives are the curl curl(.F), a vector field or the divergence div(i 7 ') = 
Px + Qy + Rz 5 a scalar field (real valued functions). Also coordinate changes 
T(u,v,w) = (x(u,v,w),y(u,v,w),z(u,v,w)) are the same mathematical ob- 
ject. 



CURVE (2D), (n = l,m = 2) For each t is defined a point r(t) = (x(t),y(t)) 
in the plane. A derivative is r'(t) = (x'(t), y'{t)), the velocity. 




CURVE (3D), (n = 1, m = 3) For each t, we have a point r(t) = (x(t), y(t), z{t)) 
in space. A derivative is r'{t) = (x' (t) , y' \t) , z' \t)) , the velocity. 



SURFACE (2D), (n = 2,m = 3) For each (u,v) there is a point r(u,v) = 
(x(u,v),y(u,v),z(u,v)) in space. The normal vector at a point r(u,v) is 
r u (u, v) x r v (u, v). 



GLOSSARY CHECKLIST Math 21a, Oliver Knill 



Geometry of Space 



coordinates and vectors in the plane and in space 

V = (vi,V2,V3),W = (wi,W2,Wz), V + W = (v\ + Wi,V 2 + ^2,^3 + U) 3 ) 

dot product v.w = ViWi + V2W2 + V3W3 = \v\\w\ cos(q;) 

cross product, v.(v x w) = 0, w.(v x w) = 0, \v x w\ = M|u>| sin(a) 

triple scalar product it ■ (v x w) volume of parallelepiped 

parallel vectors v x w = 0, orthogonal vectors v • w = 

scalar projection comp w (u) = v • w/\w\ 

vector projection proj^i?) = (v ■ w)w/\w\ 2 

completion of square: example x 2 — 4x + y 2 = 1 is equivalent to (x — 2) 2 + y 2 = —3 
distance d(P, Q) = \PQ\ = y/(Pi - Qi) 2 + (P 2 - Q2) 2 + (P3 - Q3) 2 



Lines, Planes, Functions 



symmetric equation of line = = z ~ z ° 

plane ax -\- by -\- cz = d 

parametric equation for line x = xq + tv 

parametric equation for plane x = xo + tv + sw 

switch from parametric to implicit descriptions for lines and planes 

domain and range of functions f(x, y) 

graph G = {(x, y, f(x, y))} 

intercepts: intersections of G with coordinate axes 
traces: intersections with coordinate planes 

generalized traces: intersections with {x = c}, {y = c} or {z = c} 

quadrics: ellipsoid, paraboloid, hyperboloids, cylinder, cone, hyperboloid paraboloid 

plane ax + by + cz = d has normal n = (a, b, c) 

line fo" 3 *) = y=Mn = ^0 contains v = (a, 6, c) 

sets y,z) = c describe surfaces, example graphs g(x, y,z) = z — f(x, y) 

linear equation like 2x + 3y + 5z = 7 defines plane 

quadratic equation like x 2 — 2y 2 + 3z 2 = 4 defines quadric surface 

distance point-plane: d(P, E) = |(PQ) • n|/|n| 

distance point-line: d(P,L) = |(-PQ) x 

distance line-line: d(L, M) = |(PQ) • (w x x v\ 

finding plane through three points P,Q,R: find first normal vector 



Curves 



plane and space curves f(t) 

velocity f{t), Acceleration r "(t) 

unit tangent vector f{t) = r'(t)/\r'(t)\ 

unit normal vector N{t) = f'(t)/\f'(t)\ 

f'(t) is tangent to the curve 

v = r ' then f = J* v dt + c 

f(t) = (/(£) cos(t), r(t) sin(t)) polar curve to polar graph r = f{9). 
r(t) = r (l + e)/(l + ecos(t)) ellipse. 
The three Kepler laws. 



I Surfaces | 

polar coordinates (x,y) = (r cos(6>), r sin(#)) 

cylindrical coordinates (x,y,z) = (r cos(0), r sin(0), 2;) 

spherical coordinates (x,y,z) = (pcos(#) sin(0), psin(#) sin(0), pcos(0)) 

g{r,9) = polar curve, especially r = /(#), polar graphs 

g(r, 0, z) = cylindrical surface, i.e. r = f(z, 9) or r = /(z) surface of revolution 

g(p, 0, (f)) = spherical surface especially p = f(9, 4>) 

f(x, y) = c level curves of /(a;, y) 

g(x, y,z) = c level surfaces of g(x, y, z) 

circle: x 2 + y 2 = r 2 , f (£) = (r cos t, r sin t) . 

ellipse: x 2 / a 2 + y 2 /b 2 = 1 , f(t) = (a cos t, b sin t) 

sphere: £ 2 + ?/ 2 + z 2 = r 2 , f(u,v) = (rcoswsinv,rsinwsin?;,rcosv) 

ellipsoid: a; 2 /a 2 + ?/ 2 /6 2 + z 2 /c 2 = 1, r(w, v) = (a cos w sin v, bs'mus'mv,ccosv) 

line: ax + by = d, f(t) = (t, d/b — ta/b) 

plane: ax + by + cz = d , f(w, = fb + + vw, (a, b, c) = v x w 
surface of revolution: r(9,z) = f(z), f(u,v) = (f(v) cos(w), f(v) sin(w), v) 
graph: g(x, y,z) = z — f(x, y) = 0, f(u, v) = (it, v, /(it, v)) 

I Partial Derivatives | 

f x (x,y) = ^f(x,y) partial derivative 

partial differential equation PDE: F(f, f x , ft, fxx, ftt) = 

ft = /m heat equation 

ftt — fxx = wave equation 

fx — ft = transport equation 

/xx + /yy = Laplace equation 

L(x,y) = f(x ,y ) + f x {x ,y )(x - x ) + f y {x ,y )(y - y ) linear approximation 

tangent line: L(x,y) = L(x ,y ), ax + by = d with a = f x (x ,y ),b = f y (x ,y ), d = 

ax + by 

tangent plane: L(x, y, z) = L{xq, yo, zq) 
estimate f(x, y, z) by L(x, y, z) near (xo, yo, zq) 

\f(x, y) — L(x, y)\ in box R around (xo, yo) is < M(\x — xo\ + \y — 2/0 1 ) 2 / 2, where M is the 
maximal value of \f xx (x,y)\, \f X y(x,y)\, \f yy (x,y)\ in R. 
f(x,y) called differentiable if f x ,f y are continuous 
fxy = fyx Clairot's theorem 
f u (u,v), r v tangent to surface f(u,v) 

I Gradient | 

Vf(x,y) = (f x J y ), Vf(x,y,z) = {f x J y J z ), gradient 
Dvf = Vi/ • v directional derivative 
if(f(t)) = V/(r(t)) • f'(t) chain rule 

Vf(xo,yo, zq) is orthogonal to the level surface f(x, y,z) = c which contains (xo, yo, zq). 
j- t f{x + tv) = D v f by chain rule 

/x(x "yo%o) = f v {lo V ylz ) = Mxo,ylzo) n ° rmal Hne t0 SUrfaCe K X > V ' ^ = ° &t ( X °> *°) 

(x - a? )/x (^o,2/o,^o) + (2/ - yo)f y {xo,yo,zo) + {z - z )f z {x ,y ,zo) = tangent plane at 
(xo,yo,z ) 

directional derivative is maximal in the v = V/ direction 
f{x,y) increases, if we walk on the xy-pl&ne in the V/ direction 
partial derivatives are special directional derivatives 
if D v f(x) = for all v, then V/(f ) = 

implicit differentiation: f(x,y(x)) = 0, f x l + f y y'(x) = gives y'(x) = -f x /f y 



Extrema 



Vf(x,y) = (0,0), critical point or stationary point 
D = fxxfyy — fxy discriminant or Hessian determinant 
f{xQ 1 yo) > f{x,y) in a neighborhood of (xo,yo) local maximum 
f(xo,yo) < f{x,y) in a neighborhood of (xo,yo) local minimum 
X7f(x,y) = Wg(x,y), g(x,y) = c, A Lagrange multiplier 
two constraints: Vg = or V/ = XV g + pVh, g = c,h = d 

second derivative test: V/ = (0, 0), D > 0, f xx < local max, V/ = (0, 0), D > 0, f xx > 
local min, V/ = (0,0), D < saddle 



Double Integrals 

I fiif( x iy) dA double integral 

fa fc f( x ' dydx integral over rectangle 

fa 5gl{x) f& V) d V dx ^P e 1 re S ion 

fc Shlly) f( x > y) dxd y t yp e 11 re s ion 

/ J!r f( r ' drd ® P°l ar coordinates 
f f R \r u x r v \ dudv surface area 

fa fc f( x > y) d y dx = fc fa f( x > y) dxd y Fubini 

J J R 1 dxdy area of region R 

f fiif( x iy) dxdy volume of solid bounded by graph(f) xy-plane 



Triple Integrals 



f f fiif( x iy-> z ) d V triple integral 
fa fc fu f( x i y-> z ^ d y dx integral over rectangular box 
/„" C ( Z } /(*. V) dzdydx type I region 
/(r, 9, z) \F\ dzdrd9 cylindrical coordinates 



/ / /r/C/ 9 '^'^) P 2 sm (0) dzdrd9 spherical coordinates 

fa fc fu f( x ^^ z ) dzdydx = ^ f*f(x,y,z) dxdydz Fubini 
V = J J J R 1 dV volume of solid R 
M = J J J R p(x, y, z) dV mass of solid R with density p 
(JJJ R x dV/V : JJJ R y dV/V, JJJ R z dV/V) center of mass 



Line Integrals 



F(x, y) = (P(x, y), Q(x, y)) vector field in the plane 

F(x, y, z) = (P(x, y, z), Q(x, y, z),R(x, y, z)) vector field in space 

J c F • dr = F(r(t)) • r'(t) dt line integral 

Use also notation j c F • Tds and J Mdx + Ndy + Pdz. 

F(x,y) = Vf(x,y) gradient field = potential field = conservative 



Fundamental theorem of line integrals 



FTL: F(x, y) = V/(z, y), f a F{r{t)) ■ r'(t)) dt = f(r(b)) - f(r(a)) 
Closed loop property J c F dr = 0, for all closed curves C 

Always equivalent are: closed loop property, conservativeness and gradient field 
Mixed derivative test curl(F) ^ assures F is not a gradient field. 
In simply connected domains, curl(F) = implies conservativeness. 



Green's Theorem | 

F(x, y) = (P, Q), curl(F) = Q x - P y = V x F. 

Green's theorem: C boundary of R, then J c F ■ dr = J J R curl(F) dxdy 

Area computation: Take F with curl(F) = N x - M y = 1 like F = (-y,0) or F = (0,x) or 

F = (-y,x)/2. 

Greens theorem is useful to compute difficult line integrals or difficult 2D integrals. 



| Flux integrals | 

F(x, y, z) vector field, S = r{R) parametrized surface 
r u x r v normal vector, n = ^*^| unit normal vector 

r u x r v dudv = dS = ft dS normal surface element 

/ f s F dS = / f s F(r(u,v)) ■ (r u x r v ) dudv flux integral 

Use also notation J J s F • nda 

| Stokes Theorem | 

F(x, y, z) = (P, Q, R), curl(P, Q, R) = {R y - Q x , P z - R Xl Q x - P y ) = V x F 
Stokes's theorem: C boundary of surface S, then J c F ■ dr = J J s curl(P) • dS 
Stokes theorem is useful to compute difficult flux integrals of curl(P) or difficult line inte- 
grals. 

| Div Grad Curl | 

V = (d x , d y , d z ), grad(P) = V/,curl(P) = V x P, div(P) = V • P 
div(curl(P)) = 
curl(grad(P)) = 
div(grad(/)) = A/. 



Divergence Theorem j 

div(P,Q,i?) = P X + Q y + R z = V-P 

Divergence theorem: E bounded by S then J J s F ■ dS = J J f E div(P) dV 

The divergence theorem is useful to compute difficult flux integrals or difficult 3D integrals. 



Some topology 

Simply connected region D: can deform any closed curve within D to a point on curve. 
Interior of a region D: points in D for which small neighborhood is still in D. 
Boundary of a curve: the end points of the curve if they exist. 

Boundary of a surface S are curves which bound the surface, points in the surface which 
correspond to parameters (w, v) which are not in the interior of the parametrization domain. 
Boundary of a solid D: the surfaces which bound the solid, points in the solid which are not 
in the interior of D. 

Closed surface: a surface without boundary like for example the sphere. 
Closed curve: a curve with no boundary like for example a knot. 



Some surface parametrizations 

Sphere of radius p: r(u,v) = (pcos(w) sin(v), psin(w) sin(v), pcos(v)) 
Graph of function f(x, y): f(u, v) = (it, v, f(u, v)) 

Graph of function f(4>,r) in polar: f(u,v) = (v cos (it), v sin (it), f(u, v)) 
Plane containing P and vectors it, v: r(it, i?) = P + uu + vv 

Surface of revolution: distance g(z) of z — axes : r(u,v) = (g(v) cos (it), g(v) sin(it), v) 
Cylinder: r(u,v) = (r cos(it), r sin(it), v). 



11/21/2001, LAB: 3D-SCANNING 



Math 21a, O. Knill 



COMMERCIAL SCANNERS. 
There are scanners on the market 
which can scan 60 '000 points per 
second. The model on the photo 
to the right costs currently 410,000 
dollars (www.cyberware.com). 



THE PROBLEM. Given a surface in space. Take 
a cup for example. How do we model the surface 
mathematically? Taking pictures of the cup only 
gives us two dimensional representations. Recon- 
structing the surface from two dimensional pro- 
jections is called 3D scanning. 

SOLUTION. One method is to make two or more projections. Each point in the photo determines a 
line in space. If we have at least two photos which contain both points, the point is fixed: the intersec- 
tion of these lines is the point we are looking for. The next problem is then construct a surface from these points. 

To see how this works in detail, we split up the problem into three subproblems. 



PROBLEM 1) Given the camera position C = (a, 6, c) and the center P = (x,y,z) of the object as well 
as a point p = (w, v) on the film: find a line line r i— > C+rU which passes through the point P, where U = C — O. 

SOLUTION. UU = C-0= (wi,w 2 ,0) and TV = (0,0,1). A unit vector orthogonal to u and N is V = UxN = 
(-w 2 ,wi,0)/vS + ^)- Th en P = C + Vu + Nv. 



PROBLEM 2) Given the location C x = (x u y u zx), 
C<i = (x2,V2,Z2) of the two cameras and the location 
O = (0, 0, 0) of the center of the object. Given a point 
(ui,vi) on the first film and a point (^2,^2) on the second 
film. Find the point P on the surface. 

SOLUTION, p = (w, v) on the first film determines by 
PROBLEM 1) a line P + rU . Similarly, a point q on 
the second film determins a line Q + sV in space. The- 
oretically, these two lines meet in a point X = (x,y,z). 
However, due to errors, these two lines will miss in gen- 
eral. We assign the best possible choice, the point closest 
to both points. Because we know 113 = u\ x w 2 the di- 
rection of the closest connection between the two lines, we 
have to solve C\ + su\ + tu% = C 2 + rw 2 to get the point 
X = C\ + su\ + (t/ 2)113. This is a system of three equa- 
tions for the three unknowns £i,£ 2 ,t3. We can solve this 
system (in Lagrange multiplier problems, we solved more 
complicated nonlinear systems of equations). 



FAMILIAR EXAMPLE. If the camera C\ is on the x axes and takes a shot of the y — z plane, then the point 
(x, y, z) will be the point (y, z) in the photo. Similarly, if C 2 is on the y axes, then a point (x, y, z) will be the 
point (x, z) on the photo. Ideally, if the point is from both sides, then (x, y, z) is determined. 






PROBLEM 3) Given a set of points P» 
Model the surface as a set of triangles. 



: (xi,yi, 



) on a surface. 



SOLUTION. The solution to this problem is provided by an 
algorithm called Delaunay triangularization. It produces a 
desired set of triangles. 

Fortunately, Mathematica has already built in a func- 
tion "TriangularSurfacePlot" which is contained in the Dis- 
creteMath'ComputationalGeometry' module. The figure to the 
right shows a triangularsation applied to a random surface. 




STEP 1. DATA INPUT (C PROGRAM). A GUI xwindows 
C program 3dscan.c helps us to enter the data with a mouse. 
The two pictures are scaled, concatenated to a single picture 
and displayed. The program 3dscan.c allows to enter pairs 
of points with the mouse. The data are written onto a text 
file "result.dat" . 



STEP 2. DATAPROCESSING (MATHEMATICA). Given the object O = (0,0,0), the camera positions 
C\ = (Ci x ,Ciy,Ci z ),C2 = (C 2x , C*2y, C2z) and the points (ui,vi) on the first film and (1*2,^2) on the second 
film. The formula for the point P(x, y, z) are obtained from solving A\ + 1\ * C\ + £3 * TV = A2 + 1 2 * C 2 , where 
Ai = d + Ui(d -0)x N/\d -0\ + ViN. 

A mathematica program "stereo3d.m" takes the data in "result.dat" and writes a new file "3d.dat" which 
contains the points Pi in space. 



Also the triangularisation is done with Mathematica. The program is called " triangular. m" . It takes as in input 
the file "3d.dat" and outputs a Povray file " surface. inc" which is essentially a set of triangles written in form, 
Povray can read. 



STEP 3. RENDERING (POVRAY). Finally, with all the triangles known, we run the data through a ray 
tracing program " surface. pov" . This Povray program takes the file "result. inc" as an input and produces a 
picture " surface. ppm" . 



POVRAY. Povray 



is a free raytracmg 


camera { up y right x location <S) r l r -~5 


> look.at <0,-l,3> } 


program. It allows 


light.source { <0,300, -100> colour 


rgb <144> } 


to render three di- ^^^^ 


ttbackground { rgb <i,l,i> } 




mensional objects. ^fl 


ttdeclare r = texture { 




To the right, you 


pigment { rgb <i,0,0> } 




sec a simple Povray 


finish { phong l + ambient + 5 diffu 


se t 8} 


program. It simply ^^^^ 


}■ 




shows a red sphere 


sphere {<0,0,0>,1 texture{ r } } 




on a white back- 




9,0-1 Top 



ground. 



THE LANGUAGE POVRAY. 



< a,b,c> 


point or vector (a,b,c) 


spherejP, r} 


sphere centered of radius r centered at (a,b,c) 


box{P, Q} 


rectangular box with diagonal corners P,Q 


cylinder {P, Q,r] 


cylinder of radius r over segment P Q 


plane{7V, d] 


plane with normal vector TV, in distance d from origin 


triangle { P,Q,R } 


triangle with endpoints P,Q,R 



One can take intersections, 



unions, differences of objects. Furthermore, every object can be scaled, rotated and translated. 
In the example above, you see how to apply textures. Colors are represented as vectors < r,g,b >, where r is 
"red" and g is "green" and b is "blue". The addition of colors is "additive". For example < 1, 1,0 > is yellow. 
The best way to learn the language is to look at examples. There are thousands of examples available on the 
web. The official distribution of Povray includes many examples and contains a nice manual. 
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INTRODUCTION. Topics beyond multi- variable calculus are usually labeled with special names like "linear 
algebra", "ordinary differential equations", "numerical analysis", "partial differential equations", "functional 
analysis" or "complex analysis". Where one would draw the line between calculus and non-calculus topics is 
not clear but if calculus is about learning the basics of limits, differentiation, integration and summation, then 
multi- variable calculus is the "black belt" of calculus. Are there other ways to play this sport? 



HOW WOULD ALIENS COMPUTE? On an other planet, calculus might be taught in a completely different 
way. The lack of fresh ideas in the current textbook offerings, where all books are essentially clones (*) of each 
other and where innovation is faked by ejecting new editions every year (which of course has the main purpose 
to prevent the retail of second hand books), one could think that in the rest of the universe, multi- variable 
calculus would have to be tought in the same way, and where chapter 12 is always the chapter about multiple 
integrals. Actually, as we want to show here, even the human species has come up with a wealth of different 
ways to deal with calculus. It is very likely that calculus textbooks would look very different in other parts of 
our galaxy. This week broke the news that one has discovered a 5'th arm of the milky way galaxy. It is 77'000 
light years long and should increase the chance that there are other textbooks in our home galaxy. In this text, 
we want to give an idea that the calculus topics in this course could be extended or built completely differenty. 
Actually, even numbers can be defined differenty. John Conway introduced once numbers as pairs {L\R} where 
L and R are sets of numbers defined previously. For example {0|0} = and {O|0} = 1, {0|O} = —1, {0|1}. 
The advantage of this construction is that it allows to see "numbers" as part of "games". Donald E. Knuth, 
(the giant of a computer scientist, who also designd "TeX", a typesetting system in which this text is written), 
wrote a book called "surreal numbers" in which two students find themselves on an iland. They find a stone 
with the axioms for a new number system is written and develop from that an entirely new number system 
which contains the real line and more. The book is a unique case, where mathematical discovery is described 
as a novel. This is so totally different from what we know traditionally about numbers that one could expect 
Conway to be an alian himself if there were not many other proofs of his ingenious creativity. 
(*) One of the few exceptions is maybe the book of Marsden and Tromba, which is original, precise and well written. 
It is unfortunately slightly too mathematical for most calculus consumers and suffers from the same disease that other 
textbooks have: a scandalous prize. A definite counterexample is the book "how to ace calculus" which is funny, 
original and contains the essential stuff. And it comes as a paperback. Together with a Schaum outline volume (also 
in paperback), it would suffice as a rudimantary textbook combination (and would cost together half and weight one 
fourth) of the standard doorstoppers. 

NONSTANDARD CALCULUS. At the time of Leon ard Euler, people thought about calculus in a more intuitive, 
but less formal way. For example | (1 + x/n) n = e x ~[ with infinitely large n would be perfectly fine. A modern 
approach which catches this spirit is "nonstandard analysis", where the notion of "infinitesimal" is given a 
precise meaning. The simplest approach is to extend the language and introduce infinitesimals as objects 
which are smaller than all standard objects. We say x ~ y if \x — y\ is infinitesimal. All numbers are 
traditionally defined like tt or y/2 are " standard" . The notion which tells that every bounded sequence has an 
accumulation point is expressed by the fact that there exits a finite set A such that all x E D are infinitesimally 
close to an element in A. The fact that a continuous function on a compact set takes its maximum is seen by 
taking M = max ae ^/(a). 

What impressed me as an undergraduate student learning nonstandard calculus (in a special course completely 
devoted to that subject) was the elegance of the language as well as the compactness in which the entire calculus 
story could be packed. For example, to express that a function / is continuous, one would say x ~ y then 
f(x) ~ f{y)- This is more intuitive than the Weierstrass definition Ve > 035 > 0\x — y\ < 5 \f{x) — f(y)\ < e 
which is understood today primarily by intimidation. To illustrate this, Ed Nelson, the founder of one of 
the nonstandard analysis flavors, asks the meaning of | V<5 > 03e > 0\x — y\ < 5 =>- \ f(x) — f(y)\ < e | in order to 
demonstrate how unintuitive this definition really is. The derivative of a function is defined as the standard 
part of f'{x) = (f(x + dx) — f(x))/dx, where dx is infinitesimal. Differentiability means that this expression 
is finite and independent of the infinitesimal dx chosen. Integration f(y) dy is defined as the standard part 
°f J2 Xj e[a,x] f( x i)dx where Xk = kdx and dx is an infinitesimal. The fundamental theorem of calculus is the 
triviality F(x + dx) - F(x)) = f(x)dx. 

The reasons that prevented nonstandard calculus to go mainstream were lack of marketing, bad luck, being 
below a certain critical mass and the initial believe that students would have to know some of the foundations 
of mathematics to justify the game. It is also quite a sharp knife and its easy to cut the finger too and doing 
mistakes. The name "nonstandard calculus" certainly was not fortunate too. People call it now "infinitesimal 
calculus" . Introducing the subject using names like "hyper-reals" and using "ultra-filters" certainly did not help 
to promote the ideas (we usually also dont teach calculus by introducing Dedekind cuts or completenes) but 
there are books like by Alain Robert which show that it is possible to teach nonstandard calculus in a natural 
way. 



DISCRETE SPACE CALCULUS. Many ideas in calculus make sense in a discrete setup, where space is a graph, 
curves are curves in the graph and surfaces are collections of " plaquettes" , polygons formed by edges of the 
graph. One can look at functions on this graph. Scalar functions are functions defined on the vertices of the 
graphs. Vector fields are functions defined on the edges, other vector fields are defined as functions defined on 
plaquettes. The gradient is a function defined on an edge as the difference between the values of / at the end 
points. 

Consider a network modeled by a planar graph which forms triangles. A scalar function assigns a value f n 
to each node n. An area function assigns values fa to each triangle T. A vector field assign values F nm to 
each edge connecting node n with node m. The gradient of a scalar function is the vector field F nm = f n — f m . 
The curl of a vector field F is attaches to each triangle (k,m,n) the value curl(F)fc mn = Fk m + F mn + F n k- 
It is a measure for the circulation of the field around a triangle. A curve 7 in our discrete world is a set of 
points rj,j = 1, . . . , n such that nodes rj and 77+1 are adjacent. For a vector field F and a curve 7, the line 
integral is Y^j=i Fr(j)r j+1 - A region R in the plane is a collection of triangles T. The double integral of 
an area function Jt is ^2 TeR fT- The boundary of a region is the set of edges which are only shared by one 
triangle. The orientation of 7 is as usual. Greens theorem is now almost trivial. Summing up the curl over a 
region is the line integral along the boundary. 

One can push the discretisation further by assuming that the functions take values in a finite set. The integral 
theorems still work in that case too. 



QUANTUM MULTIVARIABLE CALCULUS. Quantum calculus is "calculus without taking limits". There are 
indications that space and time look different at the microscopic small, the Planck scale of the order h. One of 
the ideas to deal with this situation is to introduce quantum calculus which comes in different types. We discuss 
q-calculus where the derivative is defined as D q f(x) = d q f(x)/d q (x) with d q f(x) = f(qx) — f(x) . You can 



see that D q x n = [n]x n 1 , where [n] = q q _^ . As q — > 1 which corresponds to the deformation of quantum 
mechanics with Ti — > to classical mechanics, we have [n] — > n. There are quantum versions for differentiation 
rules like D q (fg)(x) = D q f(x)g(x) + f(qx)D q g(x) but quantum calculus is more friendly to students because 
there is no simple chain rule. 

Once, we can differentiate, we can take anti-derivatives. It is denoted by J f(x) d q (x). As we know the derivative 
of x n , we have J x n dq{x) = ^ n a n x n+1 /[n+ 1] + C with a constant C. The anti-derivative of a general function 

is a series J f(x) d q (x) = (1 — q)x^^ q j f(q j x) For functions f(x) < M/x a with < a < 1, the integral 



is defined. With an anti-derivative, there are definite integrals. The fundamental theorem of q-calculus 

f(x)d q (x) = F(b) — F(a) where F is the ant i- derivative of / still holds. Many results generalize to q-calculus 
like the Taylor theorem. A book of Kac and Cheung is a beautiful read about that. 

Multivariable calculus and differential equations can be developed too. A handicap is the lack of a chain rule. 
For example, to define a line integral, we would have to define J F d q r in such a way that J V/ dr q = f(r(b)) — 
f(r(a)). In quantum calculus, the naive definition of the length of a curve depends on the parameterization of 
the curve. Surprises with quantum differential equations QODE d q f = / which is f(qx) — f(x) = (q-l)f(x) 
simplifies to f(qx) = qf{x). A classic differentiation gives f'{qx) = f'(x) showing that the QODE has many 
solutions f(x) = Cx + g(\og(x)/ log(g)), where g{x) = g(x + 1) is periodic. 



INFINITE DIMENSIONAL CALCULUS. Calculus in infinite dimensions is called functional analysis. Func- 
tions on infinite dimensional spaces which are also called functionals for which the gradient can be defined. The 
later is the analogue of D u f. One does not always have D u f = Vf • u. An example of an infinite dimensional 
space is the set X of all continuous functions on the unit interval. An example of a two dimensional surface in 
that space would be r(w, v) = (cos(w) s'm(v)x 2 + sin(u) sin(v) cos(;r) + cos(v)/(l + x 2 ). This surface is actually 
a two dimensional sphere. On this space X one can define a dot product f • g — J f(x)g(x) dx. 
The theory which deals with the problem of extremizing functionals in infinite dimensions is called calculus of 
variations. There are problems in this field, which actually can be answered within the realm of multivariable 
calculus. For example: a classical problem is to find among all closed regions with boundary of length 1 the one 
which has maximal area. The solution is the circle. One of the proofs which was found more than 100 years ago 
uses Greens theorem. One can also look at the problem to find the polygon with n edges and length 1 which 
has maximal area. This is a Lagrange extremization problem with regular polygons as solutions. In the limit 
when the number of points go to infinity, one obtains the isoperimetric inequality. Other problems in the 
calculus of variations are the search for the shortest path between two points in a hilly region. This shortest 
path is called a geodesic. Also here, one can find approximate solutions by considering polygons and solving 
an extremization problem but direct methods in that theory are better. 



GENERALIZED CALCULUS. In physics, one wants to deal with objects which are more general than functions. 
For example, the vector field F{x 1 y) = {—y 1 x)/{x 2 + y 2 ) has its curl concentrated on the origin (0,0). This is 
an example of an object which is a distribution. An example of such a Schwartz distribution is a "function" 
/ which is infinite at 0, zero everywhere else, but which has the property that J fd dx = 1. It is called 
the Dirac delta function. Mathematically, one defines distributions as a linear map on a space of smooth 
"test functions" which decay fast at infinity. One writes (/, 4>) for this. For continuous functions one has 
(f,4>) = f(x)4>(x) dx, for the Dirac distribution one has (/, 4>) = 0(0). One would define the derivative 

{0 x < 
1 x > ° ne ^ aS 

(H,(f)) = f^&ix) dx and because {H.cf)') = f^^ftix) dx = — 4>'{0) one has {H',(j)) = 0(0) the Dirac delta 
function. The Dirac delta function is still what one calls a measure, an object which appears also in probability 
theory. However, if we differentiate the Dirac delta function, we obtain (D',0) = —(D,(f)') = — 4>'(0). This is 
an object which can no more be seen as a probability distribution and is a new truly "generalized" function. 

COMPLEX CALCULUS Calculus in the complex is called complex analysis. Many things which are a bit 
mysterious in the real become more transparent when considered in the complex. For example, complex analysis 
helps to solve some integrals, it allows to solve some problems in the real plane better. Here are some places 
where complex analysis could have helped us: to find harmonic functions, one can take a nice function in the 
complex like z 4 = (x + iy) 4 = x 4 — x 2 y 2 + y 4 + i(x 3 y + xy 3 ) and look at its real and complex part. These 
are harmonic functions. Differentiation in the complex is defined as in the real but since the complex plane is 
two dimensional, one asks more: f(z + dz) — f(z)/dz has to exist and be equal for every dz — > 0. One writes 
d z = {d x — id y )/2 and complex differentiation satisfies all the known properties from differentiation on the 
real line. For example d/dzz n = nz n ~ 1 . Multivariable calculus very much helps also to integrate functions 
in the complex. Again, because the complex plane is two dimensional, one can integrate along paths and the 
complex integral J c f(z) dz is actually a line integral. If z(t) = x(t) + iy(t) is the path and f = u + iv then 

Greens theorem for example is the easiest way to derive 



J c f(z) dz = Ja(ux' — vy')dt + i J^(uy' + vx') dt 



the Cauchy integral theorem which says that J c f(z) dz = if C is the boundary of a region in which / is 
differentiable. Integration in the complex is useful for example to compute definite integrals. 
One can ask, whether calculus can also be done in other number systems besides the reals and the complex 
numbers. The answer is yes, one can do calculus using quaternions, octonions or over finite fields but each of 
them has its own difficulty Quaternion multiplication already does no more commute AB ^ BA and octonions 
multiplication is even no more associative (AB)C 7^ A(BC). In order to do calculus in finite fields, differentiation 
will have to be replaced by differences. A Finnish mathematician Kustaanheimo promoted around 1950 a finite 
geometrical approach with the aim to do real physics using a very large prime number. These ideas were later 
mainly picked up by philosophers. It is actually a matter of fact that the natural numbers are the most 
complex and the complex numbers are the most natural. 

CALCULUS IN HIGHER DIMENSIONS. When extending calculus to higher dimensions, the 
concept of vectorfields, functions and differentiations grad,curl,div are reorganized by introduc- 
ing differential forms. For an integer < k < n, define k-forms as objects of the form 



a = J2i a idxi = ^ il<i2< <ik ai x ^.^i k dxi x A . . . A dxi k , where dxi A dxj is a multiplication called exte- 
rior product which is anti-commutative: a A (3 = —(3 A a. Then, one defines an exterior derivative 
as d^jdidxi = dai A dxj In three dimensions, where we write dx = dx±,dy = dxi and dz = dx% 



k = 


form a = 


da = 





f 


fxdx + fydy + f z dz 


1 


Mdx + Ndy + Pdz 


(N x - M y )dx Ady+ (P x - M z )dx Adz+ (P y - N z )dy A dz 


2 


Adx\ A dx2 + Bdx\ A dx% + Cdx2 A dxs 


(A z -B y + C x )dx Ady Adz 


3 


gdxi A dx2 A dx% 


0. 



(The computation of the curl was N x dx A dy + P x dx Adz -\- M y dy A dx + P y dy A dz + M z dz A dx + N z dz Ady = 
(N x — M y )dx Ady + (P x — M z )dx A dz + (P y - N z )dy A dz. ) After an identification of with 3 forms and 
1 with 2 forms (called Hodge *operation) one can see the exterior derivatives as gradient, curl and divergence: 



It is useful also to clarify the notion of a surface by defining manifolds. Many definitions which we have 
seen for curves or surfaces can be extended to manifolds. The dot product can be defined more generally on 
manifolds and leads to a theory called Riemannian geometry used in the theory of general relativity. When 
learning relativity, one deals with 4-dimensional manifolds which incoorporate both space and time. There, the 
language of differential forms is already mandatory. The exterior derivative of a 1 form is a 2 form which has 
6 component. An example of such a 2-form is the electromagnetic field F = dA combining 3 electric and 3 
magnetic components. The Maxwell equations are dF = 0, d*F = /, where d* = *d* is defined using the 
Hodge * operation. The consequence d*dA = I becomes the wave equation d*dA = in the absense of / a 
vector incoorporating both electric charge and current i. 



FRACTAL CALCULUS. We have dealt in this courses with 0-dimensional objects (points), 1-dimensional 
objects (curves), 2-dimensional objects (surfaces) as well as 3-dimensional objects (solids). Since more than 100 
years, mathematicians also studied fractals, objects with noninteger dimension are called fractals. An exampls 
is the Koch snowflake which is obtained as a limit by repeated stellation of an equilateral triangle of initial 
arclength A = 3. After one stellation, the length has increased by a factor 4/3. After n steps, the length of the 
curve is A(4/3) n . The dimension is log(4/3) > 1. There are things which do no more work here. For example, 
the length of this curve is infinite. Also the curve has no defined velocity at all places. One can ask whether 
one could apply Greens theorem still in this case. In some sense, this is possible. After every finite step of this 
construction one can compute the line integral along the curve and Greens theorem tells that this is a double 
integral of curl(F) over the region enclosed by the curve. Since for larger and large n, less and less region is 
added to the curve, the integral J J curl(F) dxdy is defined in the limit. So, one can define the line integral 
along the curve. Closely related to fractal theory is geometric measure theory, which is a generalization 
of differential geometry to surfaces which are no more smooth. Merging in ideas from generalized functions 
and differential forms, one defines currents, functionals on smooth differential forms. The theory is useful for 
studiing minimal surfaces. Fractals appear naturally in differential equations as attractors. The most infamous 
fractal is probably the Mandelbrot set, which is defined as the set of complex numbers c for which the iteration 
of the map f(z) = z 2 + c starting with leads to a bounded sequence — > c — > c 2 + c — > (c 2 + c) 2 + c... The 
boundary of the Mandelbrot set is actually not a fractal. It is so wiggly that its demension is actually 2. One 
can modify the Koch snowflake to get dimension 2 too. If one adds new triangles each time so that the length 
of the new curve is doubled each time, then the dimension of the Koch curve is 2 too. 



THE FOUNDATIONS OF CALCULUS. Would it be possible that aliens somewhere else would build up 
mathematics radically different, by starting with a different axiom system? It is likely. The reason is that 
already we know that there is not a single way to build up mathematical truth. We have some choice: it 
came as a shock around the middle of the last century that for any strong enough mathematical theory, one 
can find statements which are not provable within that system and which one can either accept as a new 
axiom or accept the negation as a new axiom. Even some of the respected axioms are already independent 
of more elementary ones. One of them is the axiom of choice which says that for any collection C of 
nonempty set, one can choose from each set an element and form a new set. A consequence of this axiom 
which is close to calculus is a "compactness property" for functions on the interval. If we take the distance 
d{f,g) = max(/(x) — g{x)) between two functions and take a sequence of differentiable functions satisfying 
\fn{x)\ < M and \f' n (x)\ < M then there exists a subsequence f nk which converges to a continuous function. 

A rather counterintuitive consequence of the Axiom of choice is that one can decompose the unit ball into 
5 pieces Ei, move those pieces using rotations and translations and reassemble them to form two copies of 
the unit ball. We can not integrate J J J E dV over this regions: the sum of their volumes would be either 
47r/3 or 87r/3 depending on whether we integrate before or after the arrangement. This is such a paradoxical 
construction that one calls it a paradox: the Banach-Taski paradox. 

An other axiom for which is not clear, whether it matters in calculus is the continuum hypothesis which tells that 
there is no cardinality between the "infinity" of the natural numbers Ko and the "infinity" of the real numbers ot\. 

It had been realized in the 19'th century by Cantor 

that these two infinities are different. Cantors ar- ^ [214231423412341234134 
gument is to assume that one could enumerate all 
numbers between and 1: then look at the diagonal 
number, in whic h each digit is altered. For example: 
I x = 0.35285.... | . This real number x disagrees with 

all the numbers in the list and was therefore not ac- 5 0.36200^747389238934211147. 
counted for in the counting. 

The possibilities to question or alter the mathematical builup does not stop on the set theoretical level. People 
even tried to change logic. Among things which have been proposed are fuzzy logic, where truth can take 
a value between and 1 or quantum logic or a logic in which one has three possibilites, true, not true or 
" undecided" . 

One not only has the possibility to extend mathematics differently. A rather rich playground can be covered 
by restricting tools. One can for example ask, what part of calculus can still be done in without the axiom 
of choice. One can also ask that one only is allowed to talk about objects which can be constructed explicitely. 
There were people, strict finitists, who went even further and suggested to disqualify too large numbers like 
10 10 . Calculus teachers have even to go further and produce problems so that the answer is 1 or or tt or 
1/3. Problems in final exams, where the answer is 1234/12 is unthinkable. The philosophical direction teachers 
are forced to follow is called ultra strict finitism... 



2 0.304223498273413904173... 

3 0.6904341074147346738874... 

4 0.999^4283382464200104131... 



POVRAY 



Math 21a, O. Knill 



A SCENE. The y axes is up. We place the 
object (the intersection of three cylinders) at 
the origin and the camera slightly above on 
the z axes. 




RUNNING POVRAY. If the povray file is called test.pov, 
run povray as "povray good.ini +i test.pov". The .ini file 
contains instructions like size of the picture, quality of ren- 
dering etc. With "povray anim.ini +i test.pov", a sequence 
of pictures is rendered. The .ini file contains now instruc- 
tions how many frames the movie should have. 




WHAT IS RAYTRACING? The objects, the camera and lights are in place. If photons leave the light source, 
they will reflect at the objects, change color and intensity and some of them will reach the camera. Adding up 
the light intensities of all these photons gives the picture. Raytracers work more efficient: instead of shooting 
photons from the light source and wasting most of the photos because they don't reach the film, it is better to 
start the light ray at the film and compute backwards. 



COLORS. Every color is given by a 
vector < r, g,b > in the color cube 
[0,1] x [0,1] x [0,1]. Color vectors 
can be added like usual vectors. 



^ # m 



PROGRAMMING. Povray is a 
programming language. Macros 
are procedures which can be 
reused and called within the the 
macro. This allows recursion like 
in this rendering of the 3D frac- 
tal to the right. You see also how 
one can play with colors. Identi- 
fying them with vectors in space 
colors the pyramid according to 
the color cube. 





OBJECTS. Many objects like 
quadrics ax 2 + bxy + cy 2 + dx + 
ey + fz + g = are predefined 
in Povray. The most basic ob- 
jects are boxes, spheres, cylin- 
ders, lathes (extruded curves). 
By joining, intersecting or taking 
differences of such objects, one 
can build more complicated ob- 
jects. Additionally, every object 
can be rotated and scaled. 





TEXTURES. Textures to plan- 
ets of the Solar system are 
available on the web. The 
texture is mapped onto the 
sphere using the map X(u,v) = 
(cos(u) sin(i>), sin(w) sin(v), cos(v) 
In the source code, the units are 
chosen so that l=lkm. Having 
natural units allows to model 
things more easily. In the scene 
an artificial light has been added 
from the camera in order to see 
the rings better. 



mils 

la sm^B^--^ 74500 

^»H sy ' ,w 92000 
98380 

^3i^***~~ 140210 



WATER AND SKY. Water and 
sky are frequenly added, often by 
"artistic" reasons. 



SMS ! rK: :IZ £s $ '<l±ll ! 





REFRACTION. When light 
passes from one medium to an 
other, the path of the ray of light 
is bent because the speed in dif- 
ferent media is different and the 
path choses the path traversed 
in shortest time. This is called 
refraction. For example air and 
water have different densities 
and thus refract differently. 




The index of refraction ior value 
is used to describe the relative den- 
sity of substances. The default ior 
value 1.0 (air) gives no refraction. 
Water has ior 1.33, glass is 1.5 and 
diamond 2.4. In the picture to the 
right, the camera is under water, the 
light source above the water. You 
see caustics like at the bottom of a 
swimming pool. 




EXTERIOR PRO- 
GRAMS. Many organic 
or an-organic molecules 
are available on the web 
in .pdb format. There 
are viewers which allow 
to see the molecule and 
translators, which export 
a povray file, which can 
then be rendered. To the 
right, you see the caffeine 
molecule. 
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ANIMATION. When doing animations, the camera moves along a curve. The velocity and acceleration vector 
can help to determine, in which direction the camera should look. The frames below were made with a bridge 
built after the Rheinfall bridge in Switzerland. 




RAY TRACING COMPETITION. There is a annual ray tracing comptetion going on. Here is an example of 
an entry from the 1999 competition The authors of the scene are Ian and Ethel MacKay. The source file for 
this scene is a single povray file. 



RENDERING. Raytracing is a CPU time intensive task. The software has to compute the light ray paths 
bouncing around in a virtual world, compute reflections or refractions. Tracing an image can take from a few 
seconds to days. A single frame in movies like "Toy Story" takes several hours to render. To get the large 
number of frames needed for a movie companies like "Pixar" (recently again visible with Monsters inc.) use 
" computer farms" a huge number of workstations now mostly running linux. 



THE ROLE OF MATH 21a. Many topics you learned appear in the area of ray tracing. For example, to 
compute the normal vector to a polygon, one uses an area formula, which we have proven in a homework using 
Green's theorem (see below). Normal vectors play an important role because they are needed to compute 
the reflection of rays. Just look inside a book or article on 3D graphics and many now familiar objects will pop up. 

EXAMPLE 1). The method of computing normals to a surface by just looking at three points is error prone. 
It is often better to consider a polygon Pi = (x{, yi, Zi) on the surface. What is the normal to such a polygon? 
Note that the points Pi are not necessarily on a plane. Here is what people do in 3D graphics: consider the xy 
projection of the polygon. This gives a polygon (xi,yi) in the plane. Now compute the area of that projected 
polygon with the formula 1/2 ^2 k {xk+i + Xk){yk+i — Dk) (see page 275 in your book). This formula was proven 
using Green's theorem but can be seen geometrically. This area will be the third component c of the normal 
vector. Analoguously, compute the other components. 



The normal vector to a not necessarily planar polygon Pi = 


(xi,yi,Zi) in space is defined as 




V 2 J2k(Vk+l + Vk)(z k +i - z k ) 




n = 


V 2 J2 k ( z k+i + z k )(x k +i - x k ) 






. l/2J2 k ( x k+i+x k )(y k+1 -y k ) _ 





EXAMPLE 2) Snells law of refraction is the problem to determine the fastest path between two points, if 
the path crosses a border of two media and the media have different indices of refraction. This is a Lagrange 
multiplier problem: 

A light ray travles from A =( — 1,1) to the point B = (1, — 1) crossing a boundary between two media (air and 
water). In air (y^O) the speed of the ray is v± = 1 (in units of speed of light). In the second medium (yjO) 
the speed of light is V2 = 0.9. The light ray travels on a straight line from A to a point P = (x, 0) on the 
boundary and on a straight line from P to B. Verify Snell's law of refraction sin(#i)/ sin(#2) = v\/v2- where 9\ 
is the angle the ray makes in air with the y axes and where 62 is the angle, the ray makes in water with the y axes. 

Solution: Minimize 

F(x, y) = \A-i- x) 2 + y 2 /vl + - x) 2 + y 2 /v2 = h/vl + l 2 /v2 

under the constraint G(x,y) = y = 0. The Lagrange equations show that F x (x,y) = 0. This is already Snells 
law F x = v 1 2(x + l)/{2l 1 )+v 2 2(l - x)/{2l 2 ) = means vi/v 2 = sin(0i)/sin(0 2 )- 



http://www.povray.org official povray site 
http://www.povworld.org collection of objects 



OTHER RAYTRACING PROGRAMS OR MODELERS. Note that Povray provides the source code to the 


software in contrast to commercial software which can be very expensive. 


http:/ /www. blender.nl 


Blender (free 3D modeling) 


http:/ /www. aliaswavefront.com 


Alias Wave Front (Maya) 


http:/ /www. light wave, com 


Lightwave 3D 


http:/ /www. corel. com 


Bryce 


http:/ /www. curiouslabs.com 


Poser (3D figure design) 


http:/ /www. eovia.com 


Carrara (Ray Dream studio) 


http:/ /www. art ifice.com 


Radiance (architecture) 


http:/ /www. 3dlinks.com 


Links 
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MATHEMATICA 



Math21a, O. Knill 



VECTORS. Vectors v = (x,y,z), the dot product 
v ■ w or the vector product v x w. 



PROJECTION of v onto w is w(v ■ w)/\w\ 2 . 
SCALAR PROJECTION: length of projection. 



l[v_]:=Sqrt[v.v]; pro j [v_, w_] : =w (v.w) / (w.w) ; 
scalarpro j [v_, w_] : =1 [pro j [v, w] ] ; 
v0={3,4,5}; w0={4,5,l}; scalarpro j [vO , wO] 



DISTANCES. 
d(P,Q) = \P-Q 
d{P,Q + tv) = \{P-Q) x v)\/\v\ 
d(P, Q + tv + sw) 
d(P + tv,Q + sw) 



\(P-Q)-(vXw)\ 

\vxw\ 
\(P-Q)-(vXw)\ 



\vxw\ 



PP [P_, Q_] : =Sqrt [ (P-Q) . (P-Q) ] ; 

PL [P_, {Q_, v_} ] : =Module [ {n=Cross [P-Q, v] /l [v] } , 1 [n] ] ; 

PS [P_, {Q_, v_, w_} ] : =Module [ {n=Cross [v, w] } , (P-Q) . n/1 [n] ] ; 

LL[{P_,v_}, {Q_,w_}] :=Module[{n=Cross[v,w] },m=l[ (P-Q) .n]/l[n] ; 



PLANE CURVES. r(t) = (x(t),y(t)). 
SPACE CURVES. r(t) = (x(t),y(t),z(t)). 

ParametricPlot[{Cos[t] Cos [17t] , Sin [t] Cos [17t] } , {t, 0, 2Pi} ] 
ParametricPlot3D[{t Cos[10t],t Sin [lOt] , t} , {t, -2Pi, 2Pi} ] 





SURFACES. X(u,v) = (x(u,v),y(u,v), z(u,v)). 
GRAPHS. Graph of map z = f(x, y). 

ParametricPlot3D[{v Cos[u],v Sin[u],5 v A 3},{u,0,2 Pi) , {v, -5, 10} ] 
S=Plot3D[Sin[x*y] , {x,-2,2}, {y,-2,2}] 





INTEGRAL. Antiderivative J f(x) dx and definite integral J a f(x) dx. 

Integrate [Cos [Sqrt [x] ] , x] NIntegrate [Cos [Sqrt [x] ] /x A 2, {x, 1, Infinity } ] 



LENGTH OF CURVE. £ y?x + y 2 + z 2 dt. 
Explicit solutions with "Integrate". 



DOUBLE INTEGRAL. £ £ f(x, y) dxdy. 



Integrate! Sin[x] x A 2+y A 3, {x, 0, 2 Pi}, {y, 0,3 Pi} ] 



TRIPLE INTEGRAL, ff f(x,y, Z) dxdydz. integrate [ Sin[x] x A 2+y A 3+z A 4, {x,0,2 Pi}, [y, 0,3 Pi},{z,0,Pi} 



LINE INTEGRAL. /* F(r(t)) ■ r'(t) dt. 



FLUX INTEGRAL. 

f b a ^F(u,v) ■ (X u x X v )(u,v) dvdu. 



X[u_,v_] :={Cos[u] ,Sin[u] ,v}; F [ [x_, y_, z_} ] :={x A 2,y,z}; 
Xu[u_,v_] :=D[X[s,v],s] /. s->u; Xv [u_, v_] : =D [X [u, t ] , t ] /. 
R [u_, v_] : =Cross [Xu [u, v] , Xv [u, v] ] ; 
Integrate [F[X[u,v] ] .R[u,v] , {u,0,2Pi}, {v,0,l}] 



LAGRANGE MULTIPLIERS. Extremize F{x,y,z) under con- 
straint G(x, y, z) = c. Solve system VF(x, y, z) = 
X\JG{x, y, z), G(x, y,z) = c for the x, y, z, A. 



F[x_,y_, z_] :=-x A 2-y A 2-z A 2; 
G[x_,y_,z_] :=x+y+z; 

Solve [{D[F[x,y,z] ,x]== L*D[G[x,y,z] ,x] , 
D[F[x,y,z],y]== L*D [G [x, y, z] , y] , 
D[F[x,y,z] ,z]== L*D[G[x,y,z] ,z] , 
G[x,y,z]==l}, {x,y,z,L}] 



GRAD CURL DIV. 

gmd(f) = (f x J y J z ) 

rurl(P, Q, R) = (R y -Q Z ,P Z - R x , Q x - P y ) 
div{P,Q,R) = P x + Q y + R z . 


f [x_,y_,z_] :=x A 2 
grad[U ] :={D[U[x 
curl[F ] :={D[F[x 
D[F[x 
D[F[x 

div[F ] :=D[F[x,y 
+D[F[x,y 
+D[F[x,y 


y z; F[x_,y_, z_] :={-y,x, z A 2 x}; 
y,z] ,x] ,D[U[x,y,z] ,y] , D [U [x, y, z] , z] } ; 
y,z] [[3]],y]-D[F[x,y,z] [ [2] ] , z] , 
y,z] [[l]],z]-D[F[x,y,z] [[l]],x], 
y,z] [[2]],x]-D[F[x,y,z] [[3]],y]}; 
z] [[l]],x] 
z] [[2]],y] 
z] [[3]],z]; 





LINEAR APPROXIMATION. 

L(x,y,z) = f(x,y,z)+gr&d(f)(x ,yo,z )-(x-xo,y- 
yo,z- zq) 



f [x_, 



: A 2*y; 



Dx[f_] [x_,y_,z_] :=] 
Dy[f_] [x_,y_,z_] :=D[f[u,v,w],v] /. {u->x, v->y , w->z } 
Dz[f_] [x_,y_,z_] :=D[f[u,v,w],w] /. {u->x, v->y , w->z } 
grad[f_] [x_,y_,z_] : = {Dx[f] [x, y, z] , Dy [f ] [x, y, z] , Dz [f ] [x,y,z 
L[x_,y_, z_] :=f [l,l,l]+grad[f] [1,1,1] . [x-l,y-l, z-1} 



QUADRATIC APPROXIMATION. 

Q(x,y) = L(x,y) + [f"(x ,y )(x 
xo, y - y )] -(x-x ,y- y )/2. 



f [x_, y_] : =Sin [x*y] -x A 2*y ; 

Dx[f_] [x_,y_]:=D[f[u,v],u] /. {u->x,v->y} 
Dy[f_] [x_,y_]:=D[f[u,v],v] /. {u->x,v->y} 
grad[f_] [x_,y_] :={Dx[f] [x, y, z] , Dy [f ] [x,y,z]}; 
Dxx[f_][x_,y_]:=D[f[u,v],{u,2}] /. {u->x,v->y} 
Dxy[f_] [x_,y_] :=D[D[f [u,v],u],v] /. {u->x,v->y} 
Dyy[f_][x_,y_]:=D[f[u,v],{v,2}] /. {u->x,v->y} 

hess[f_] [x_,y_] :={{Dxx[f] [x,y],Dxy[f] [x, y] } , {Dxy [f ] [x,y],Dyy[f] [x,y]}} 
Q[x_,y_] :=f [l,l]+grad[f] [1 , 1] . {x-1 , y-1 }+ (hess [f ] [1 , 1] . {x-1 , y-1 } ) .{x-l,y-l}/2 



DSolve[D[p[x] ,x]==p[x]* (l-p[x] ) ,p[x] ,x] 



ODE's. j/[t] = /(p,t) 
INITIAL VALUE PROBLEM, x" = x - fix', x[0] = a, ^[0] = b 




VECTOR FIELD. F(x,y) = (P(x,y),Q(x,y)) 



Needs [ "Graphics ^PlotField 1 " ] 
PlotVectorField[[-y,2 x}, {x,-l,l}, {y,-l,l}] 
PlotGradientField[x A 2 y, {x,-l,l}, {y,-l,l}] 

Needs [ "Graphics ^PlotFieldSD 1 " ] 

PlotVectorField3D[{-y,2 x,z}, {x,-l,l}, {y,-l,l}, {z,-l,l}] 
PlotGradientField3D[x A 2 y z, {x, -1, 1 } , {y, -1, 1 } , { z, -1, 1 } ] 





ODE's in space x = F(x). The example shows the Rossler attractor. 






s=NDSolve[[x' [t]==-(y[t]+z[t]),y' [t]==x[t] +0.2*y[t],z' [t] == . 2+x [t] *z [t] -5 . 7*z [t] , 
x[0]==.3,z[0]==4.2,y[0]==1.3}, {x,y,z}, { t , , 400 } , MaxSteps->12000 ] ; 
S=ParametricPlot3D [Evaluate [[x[t] ,y[t] ,z[t] }/.s] , {t, 0,400}] 













HEAT EQUATION u t = fiu X: 



S=NDSolve[[D[u[x,t],t]==D[u[x,t], {x,2}], 

u[x,0]==Sin[5*Pi*x] ,u[0,t]==0, u [1 , t ] ==0 } , u, {x, , 1 } , { t , , . 01 } ] ; 
Plot3D [Evaluate [u[x,t]/. s [ [ 1 ] ] ] , [t , , . 01 } , [x, , 1 } ] 



WAVE EQUATION u tt - 



s=NDSolve[{D[u[x,t],{t,2}] == D [u [x, t ] , {x, 2 } ] , 

u[x,0] == Sin[4*Pi*x] , Derivative [0, 1] [u] [x, 0] = 
u[0,t] ==u[l,t]}, u, {x,0,l}, [t,0,l}]; 

Plot3D [Evaluate [u[x,t]/. s [ [1] ] ] , [t , 0, 1 } , [x, 0, 1 } ] 
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ORDINARY DIFFERENTIAL EQUATIONS 



Math21a, O. Knill 



Differential equations are laws which determine how a system evolves in time. An example of a system is our 
solar system. The motion of the planets is described by Newton equations, an ordinary differential equation 
(ODE). An other example is the electromagnetic field for a fixed charge and current distribution. The motion 
of this system is described by Maxwell equations. In vacuum, they reduce to the wave equation, describing 
electromagnetic waves like light. The Maxwell equations or the wave equations are examples of partial differential 
equations (PDE's). 

ODE. An ordinary differential equation (ODE) is an equation relating derivatives f'{t), f"{t), ... of a function 
/ or a curve t ^ f(t). Only derivative with respect to one variable can appear. Otherwise, the equation 
is called a partial differential equation (PDE). If derivatives up to the order n appear, the equation is 
called a differential equation of order n. If f(t) is a vector in R d , then the ODE is in d dimensions: 



Equation 


ODE? 


/"(*)+ cos(/(*)sin(t)) =0 


yes 


fx(x,y) + f y (x,y) = 


no 


x(t) = s'm(x(t)) 


yes 


(x,y) = (-y,x) 


yes 


curl(F) = A 


no 



Equation 


order 


dim 


f"(x) = f(x) 


2 


1 


sin(/"'0z)) = f(x) 


3 


1 


(f>(x),g'(x),h'(x)) = F(f(x),g(x),h(x)) 


1 


3 


(x,y) = (-y,x) 


2 


2 


(x) 5 = X 


1 


1 



While for usual equations like x 2 + 2x + 1 = 0, the unknown x is a number, for a differential equation like 
x = x it is a function x(t) respectively a curve t ^ x(t). Think always of t as time. 

SOLVING AN ODE. Solving an ODE means to find all functions or curves which satisfy the equation. Examples. 

f'(x) = f(x) has solutions e x or 5e x and f"{x) = —f(x) has solutions cos(x) or 4sin(x) and (x'(t), y'{t)) = 
(2,x(t)) has solutions (2t,t 2 ). 

FACT. Every ODE can be written as d/dtx(t) = F(x(t)), where F is a vector field and if is a vector: 
PROOF. Solve F(/, f, / (n) ) = for = G(f, f, f" , . . . , f^' 1 ^ and introduce the vector 

x = (x u x 2 , ■ ■ ■ , O = (/, /', /" , • • • , 7 (n - 1} ). Then x' = (x 2 , x 3 , . . . , G(x u . . . , i„_i)) = F(x). 

Example: f"{t) - sin(/"(t)) + f(t) = 0. Define x = (x(t),y(t),z(t)) = (/"(*), /(*))- Then d/dtx = 
(/"'(*), /"(*), /'CO) = (Mf"(t)) - /CO, /"(*), /'CO) = (sin(j/) - x, y, x) = F(x). By introducing t as a separate 
variable, one can have F time-independent. F is then called autonomous. By increasing the dimension, an 
ODE can be written as a first order autonomous system x = F(x). 



EXAMPLES OF ODE's. 



x(t) = ax{t). 


Population models for a > 0, radioactive decay for a < 


x = ax(l — x). 


Logistic equation: population model 


x = —ax. 


Harmonic oscillator: describes periodic oscillations 


m 3 X 3 =Y.i^™i/\Xi-Xj\ 2 


Newton equations. Example: Xj positions of planets of mass rrij. 



SOLVING DIFFERENTIAL EQUATIONS. The different approaches are illustrated with x = x. 



1) Integration. To find explicit solutions of dx/dt = x we write dx/x = dt or J ^dx = t + c. Integration 
on both sides gives log(x) = t + c or x(t) = e t e c = Ce l . 

2) Expansion. For example by Taylor expansion: if x(t) = ciq + ait + a^t 1 + then x(t) = a± + 2a 2 t + 
3ast 2 + .... Comparing coefficients gives a± = ao,a 2 = ai/2 = ao/2,as = a 2 /3 = ao/6 etc. so that 
x (t) =a (l+t/2 + t 2 /3! + ...). 

3) Approximation. Simplest numerical method is the linear approximation x(t + dt) = x(t) + dtx(t) = 
x(t) + dtx(t) = (1 + dt)x(t) and x(t + kdt) = (1 + dt) k x{t). If dt = t/n and k = n, then x(0 + t) = 
x(0 + nt/n) = (1 + t/n) n x{0). 

In the example, all three methods led to the correct solution x(t) = Ce*. In 1), C was an integration constant, in 
2), the zeroth Taylor coefficient ao, in 3), C = x(0). The explicit expression e* came in 1) from anti-derivation, 
in 2) from (1 + 1/2 + 1 2 /3! + ...) = e*, in 3) from lim n _ >00 (l + t/n) n = e t . For most ODE's, there are no closed 
form solutions. One can work however with truncated Taylor series or approximating polygons x(dt), x(2dt), . . . 
for the path x{t). 



EXISTENCE OF SOLUTIONS of x = F(x). Given x(0), there exists a solution x(t) for some time t G [0,a] if 
F is smooth. Solutions don't always exist for all times: x = x 2 can be solved by integrating dx/x 2 = dt so that 
-1/x = t + c or x(t) = -1/0 + c). From x(0) = -1/c we get c = -l/x(0) and x{t) = -l/(t - l/x(Q)). The 
solutions exists for t < l/x(0). 



THE EQUATION x = ax. For a > 0, it models a explosive chain-reaction or population growth (the number of 
newborn people is proportional to the number of people which exist already). For a < 0, x = ax has solutions 
which decay (example is radioactive decay: as more atoms there are, as more atoms decay). The explicit solution 
is x(t) = x(0)e at . For a = 0, there are only solutions x which are constant in time. 



LINEAR EQUATIONS. A ODE is linear if it can be written as x = Ax, where A is a matrix. It can be solved 
by x (t) = e At x(0), where the matrix e B is defined as e B = 1 + B + B 2 /2\ + B 3 /3\... and for example B 3 = BBB 
is obtained by matrix multiplication. To understand the solutions, it is helpful to choose coordinates in which 
A is simple, for example diagonal. (See Math21b). 



Because A 2 = -I, A 3 = 



EXAMPLE. The ODE x = —x is with y = x equivalant to (x, y) = (y, —x) and can be written as 
^ [ y] = [ -1 ] [ y } = A [ y 

At _\ 1 - t 2 /2! + t 4 /3! - ... t — t 3 /3! + t 5 /5! — ... 1 _ [ cos(t) sin(t) 
6 ~ [ -t + t 3 /3! - £ 5 /5! + ... 1 - t 2 /2\ + t 4 /3\ - ... \ ~ [ - sin(t) cos(t) 
x(t) = cos(t)x(0) + sin(t)y(0), y(t) = — sin(t)x(0) + cos(t)y(0). Indeed, the circular curves r{t) = (x(t),y(t)) 
solve the differential equation. 



-A, A A = 1,A 5 = A etc. we have 
leading to an explicit solution 



SEPARATION OF VARIABLES. A system x = g(x,t) (equivalently x = g(x,t), i = 1) can sometimes be solved 
by separation of variables: dx = f(x)g(t)dt or dx/f(x) = g(t)dt can be integrated: 

• If x = g(t), then x{t) = f Q g{t) dt. 

• If x = h(x), then dx/h(x) = dt and so t = f£ dx/h{x) = H{x) so that x(t) = i? _1 (t). 

• lfx = g(t)/h(x), then H(x) = f* h(x) dx = /* g(t) dt = G(t) so that x(t) = H~ 1 (G(t)). 

In general, there are no closed form solutions (i.e. x = e - * 2 has a solution x(t) = J * e - * 2 dt which can 
not be expressed by exp,sin, log, y 77 , etc.. The anti-derivative of e~* 2 = 1 — t 2 + t 4 /2! — t 6 /3! + . . . is x(t) = 
t - t 3 /3 + £ 4 /(3 • 2!) - t 7 /{7 • 3!) + . . .). 



2D SYSTEMS. A two dimensional ODE is of the form x = 
P(x,y),y = Q(x,y). If r{t) = (x(t),y(t)) is the solution curve, 
then the ODE tells that the velocity vector r'(t) is equal to 
F{r{t)). Often a good understanding is achieved by drawing vec- 
tor field and some solution curves. 

Examples 2) and 3) below are 2D systems. Example 1) shows a 
system in three dimensions and also the timedependent system 4) 
became a 3D system when adding t as the z variable. 



1) LORENTZ SYSTEM. 

x = 10 (y — x) 
y = —xz + 28a; — y 
8z 




2) VAN DER POL EQUATION. 

x + (x 2 — l)x + x = or 

( x 3 

x = y-{—-x) 
y = -x 




3) VOLTERRA-LODKA SYS- 
TEMS. 



4) THE DUFFING SYSTEM. 

x + x/10 - x + x 3 - 12 cos(t) = 



PARTIAL DIFFERENTIAL EQUATIONS I 



Math21a, O. Knill 



PDE. If an equation for a function contains derivatives with respect to different variables, it is called PDE. We 
will see examples of PDES, see how they are derived from physics and how one can solve them. 



EXAMPLES of partial differential equations: 

• Ut = u x advection equation, models transport 

• Uu = Aw = u xx + u yy wave equation in 2D. 

• u t = ku xx heat equation in ID 

• u t = kAu = k(u xx + u yy ) heat equation in 2D. 

• uu = Aw — m 2 u Klein-Gordon equation, ap- 
pears in quantum theory. 

• Aw = u xx + Uyy + u zz = Laplace equation in 
3D. 

• grad(f/) = F search of potential U for vector 
field F. (Need curl(F) = 0). 

• div(£) = 0, div(E) = 0, B = -c curl(E), E = 
c curl (5). Maxwell equations in vacuum. 

• curl(A) = F search of vector potential for vec- 
tor field F. (Need div(F) = 0). 



• Aw = 47rp Poisson equation, p charge, E = 
grad(w) is electric field. 

• Ut + u xxx + 6uu x = KdV equation has soliton 
solutions famous nonlinear waves. 

• ih u — Ti 2 Au/2m + Vu Schrodinger equation 

in quantum mechanics. 

• Uu — u xx + sin(w) = Sine-Gordon equation 

has "kink- ant ikink" solutions. 

• w + w • Vw = uAu — Vp + /, div(w) = Navier 
Stokes equations. Win 1 Mio prove existence! 

• w = uu x Burger equation, models shocks, 
breaking waves. 

• w = (u x ) 2 /2 + V(x) Hamilton- Jacobi equation. 
Classical mechanics. 

• det(/") = p Monge- Ampere equation, /" 

Hessian of / = f(x, y, z). Appears in geometry. 



DERIVATION OF HEAT EQUATION. Conservation of mass tells that the flux of a fluid with density p and 
speed j through a surface S bounding a region G is equal to the change of mass in the region J J s j-dS = 
J J J G p dV . By Gauss theorem, J J s j-dS = J J J G div(j) dV so that J J J G (div(j) — p) dV = 0. Because the 
spherical region is arbitrary, we have / = div(j) — p = (if the function / were not zero, we could chose a region 
G contained in / > 0). The density and the current are therefore linked by the continuity equation p = div(j). 
If particles move randomly, they tend to move more likely from places with high density p to places with low 
density. This means j = Kgrad(p). Plugging this into the continuity equation gives p = «div(grad)(p) = K,Ap. 
This is the diffusion equation, also called heat equation. 



DERIVATION OF THE WAVE EQUATION. The identity AB = grad(div(£)) - curl (curl (B)) gives together 
with div(,B) = and curl(B) = E/c the relation AB = —d/dtcurl(E)/c which leads with B = —c curl(.E') to 
the wave equation AB = d 2 /dt 2 B/c 2 . This equation B tt = c 2 AB (as well the equation E u = c 2 AB derived 
in the same way) describes the propagation of electric and magnetic fields in vacuum. 



DERIVATION OF THE TRANSPORT EQUATION. Start again with the continuity equation w = div(j) = j x 
in one dimensions. If the current j is proportional to u x we get u t = au x . 



DERIVATION OF BURGER'S EQUATION. If j = (w 2 ) x , this means that the current j depends nonlinearly 
on w, becoming large for large w. The corresponding equation u t = 2uu x is a model for waves near the beach. 



SUPERPOSITION. A PDE is called linear if it is linear in each variable, as well as in w, u xi u t etc. The heat, 
Laplace, wave, advection or Maxwell equations in vacuum are linear. The Burger equation, Sin-Gordon or 
KdE equation are examples of nonlinear equations. 

If w and v are solutions of a linear equation, then Aw and the superposition w + v are solutions too. 



THE ADVECTION EQUATION. A general solution of u t = cu x - ru is 

u(t,x) = e~ rt f(x + ct) , 

here f(x) = w(0, x). This formula provides an explicit solution of a PDE. The initial function w(0, x) determines 
u(t,x) at later time in an explicit way. 



DIFFUSION EQUATION. A special solution to the diffusion equation u t = pu xx — ru is 

u{x,t)=a^=e- rt e- x 'l^ . 
\JAi\pt 

(Check ut/u = — r + (x 2 — 2pt)/(4pt 2 ), u xx /u = (x 2 — 2pt)/(4p 2 t 2 )). Also translations u(x — s,t) are solutions. 
By superposition, sums of such functions are solutions and by passing to limits of Riemann sums, also integrals 

u(x,t) = —L= [°° a{s)e- rt e-^ 2 ^ 4 ^ ds 

is a solution. By taking the limit t — > 0, we see that a(s) = w(0, s). We have there- 
fore an explicit formula for the solution of the diffusion equation. The initial heat distribu- 
tion determines the distribution at time t. It is considered an explicit formula. It can 
be generalized to arbitrary dimensions. The three pictures show an evolution in the plane. 



WAVE EQUATION. Write u tt - u xx = as {d t - d x ){d t + d x )u or (d t + d x )(d t - d x )u. You see that 
if u satisfies the transport equation (dt + d x )u = or (dt — d x )u = 0, then u solves the wave equa- 
tion. Solutions are f(x — t) or g(x + t). Now, if u(0,x) = F(x) and u x (u,x) = G(x) are known and 
u(t,x) = f(x — t) + g(x + £), then F(x) = w(0, x) = f(x) + g(x) and G(x) = u t (Q,x) = —f'(x) + g'(x) 
determine / and g: from F = f + g,F' = f + g',G = g' — f we get /' = (F f — G)/2 and 
g' = (F' + G)/2 and so /, g. The pictures show an evolution of the wave equation in two dimensions. 



4 *f 4<c>f 4C» 



SOLUTIONS BY SEPARATIONS OF VARIABLE. Some linear equations in one dimension can be solved by 
starting with an "Ansatz" u(t,x) = T(t)X(x). 

TRANSPORT EQUATION. u t = TX 7 u x = TX'. Because f/T = X/X does not depend on x (look at LHS) 
and not depend on t (look at RHS), it is a constant C and we have T(t) = e Ct and X(x) = e Cx and therefore 
w(£, x) = T(t)X(x) = e Ct e Cx = e c ( t+x \ By summing up such solutions one build any solution f(t + x). 
DIFFUSION EQUATION. u t = TX, u xx = TX". Again, f/T = X" / X is constant C = —k 2 so that 
T(t) = e~ k * and X(x) = e lkx and u(t,x) = e~ k t e lkx . (If X is a bounded function, then C can not be positive 
so that we can write C = — k 2 .) A general solution is a superposition u(t,x) = J a(k)e~ k t e lkx dx of such 
solutions. One can determine a(k) from the initial condition w(0,x) = J a(k)e lkx dx using Fourier theory. 
WAVE EQUATION. Also the wave equation can be treated like that. We get f/T = X" /X = C. Again 
C = —k 2 and X(x) = e ±ikx ,T(t) = e ±ikt so that u(t,x) = e ±ikx e ±lkt is a solution. We see that a(k)e lk ^ x -^ 
and b(k) = e %k ^ x+t ' are solutions. Superposition leads to arbitrary solutions u(t, x) = f(x — t) + g(x + t). 

BEYOND PDE's. A computer which solves ODE's or PDE's numerically discretizes the system. u t is replaced 
by the difference u n+ \ — u n , u t t by u n+ i — 2u n — u n -\. The equation x = ax for example is discretizes as 
x n+ \ —x n = ax n or x n+ \ = (1 + a)x n . Instead of a curve x(t), the solution is a sequence x n . Time is discrete. 
Also PDE's can be discretized. The diffusion equation becomes w n+ i )7n — w njm = w n , m+ i — 2w n , m + w n , m _i 
which is w n +i )m = w nim +i — w n;Tn + w n)m _i. This is also called a coupled map lattice. In each time step, a 
lattice configuration {u n ,m}m=-oo 1S mapped into a new configuration. If also the values of w n)Tn are discretizes, 
one gets a cellular automaton. Both can be used to simulate partial differential equations numerically. 
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PARTIAL DIFFERENTIAL EQUATIONS II 



Math 21a, O. Knill 



A great deal of my work is just playing with equations and seeing what they give. I don't suppose that applies so 
much to other physicists; I think it's a peculiarity of myself that I like to play about with equations, just looking 
for beautiful mathematical relations which maybe don't have any physical meaning at all. Sometimes they do. 
Paul A. M. Dirac 



LAPLACE EQUATION. In the plane, the equation is Aw = divgradw = u xx + u yy = 0. Solutions are called 
harmonic. You have found such solutions already in an earlier homework assignment. 



Here is a general way to produce solutions: if z = x + iy 
is a complex number, then the real or imaginary part of 
(x + iy) n = r n e in6 are solutions of the Laplace equation. For 
example, u(x,y) = Re((x + iy) 3 ) = x 3 — 3xy 2 = r 3 cos(3#) or 
u(x, y) = Im((x + iy) 3 ) = 3x 2 y — y 3 = r 3 sin(30) solve the Laplace 
equation. 




V" 

In one dimension, only linear functions u(x) — ax + b are harmonic: just solve u xx = 0. In three dimensions, 
harmonic functions are constructed in spherical coordinates u(r,4>,6) = Re(r l Pf 1 '(cos(9))e' im ^) with special 
polynomials Pi. The angular portions of such functions are called spherical harmonics and denoted by Yj m . 
The pictures show three examples. These functions are important for quantum mechanics because there are 
functions R n i of r such that u n} i }Tn (r,0,<ft) = R n i(r)Yi m ((f),6) define stationary electron distributions in the 
hydrogen atom. Knowing about these functions allows to understand of the hydrogen atom and makes the 
construction of the periodic system of elements transparent. 




You can play with spherical harmonics in a Java applet on http : 
/ /www. set. gu.edu.au/research/laserP/livejava/spher Jiarm.html Harmonic functions have the prop- 
erty that they are stationary solutions to the heat equation u t = pAu or the wave equation u u = c 2 Au or the 
Schrodinger equation of a free particle. 
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POISSON EQUATION. A generalization of the Laplace equation is the Poisson equation Au = p, where p is 
a scalar function. If p is a charge distribution, then u is the electric potential: The vector field E = grad(w) 
satisfies the Maxwell equation div(E) = p. The trick to solve this Maxwell equation is to solve the Poisson 
equation. There are solution formulas using Green's functions ga, depending on the dimension d: 



ID 


u{x) = 1/2 J B p(s)\x — s\ ds. 


9i*P 


2D 


u ( x ) = I Jr2 p(s) log \x - s\ dA(s) 


92 *p 


3D 


u ( x ) = t£J I /na p(s)/\x - s\ dV{s) 


93* P 



On the right, the notation / ★ b = J f(x — y)g(y) dy is used. * is a "multiplication" of functions called 
convolution. Fourier theory allow to prove these formulas with no effort: the Poisson equation Aw = p 
becomes after Fourier transform Ym=\ x 1 u = P so ^ na ^ u = 9dP, where ga = 1/ Yli=i x i- Reversing the Fourier 
transform gives u = gd*P (there is a general rule fg — f*g. Because the Fourier transform of 1/x 2 is gi{x) = \x\, 
the Fourier transform of l/(x 2 + y 2 ) is #2 (a?) = ^ log \x 2 + y 2 \ and the Fourier transform of l/(x 2 + y 2 + z 2 ) is 

(x 2 + y 2 + z 2 ), the solution formulas become obvious. 
UNIQUENESS OF SOLUTIONS? Note that if you take a solution u of the Poisson equation and add a harmonic 
function u satisfying Aw = 0, you get an other solution. 

ABOUT THE GREEN FUNCTIONS. The Green functions g d are the natural Newton potentials in R d . The 
one dimensional potential is studied because two infinite planes attract each other with force \x\. The n-body 
problem with such potentials attractive (gravitational) or repelling (electric) is studied a lot. 



EXISTENCE OF SOLUTIONS. Having explicit solutions to partial differential equations is rather exceptional. 
In general one has to work hard to establish existence of solutions. Existence can be subtle. Even for one 
of the simplest nonlinear PDE's grad(w) = F, there are not necessarily solutions. Only if curl(F) = 0, then 
a potential u can exist. For nonlinear PDE's, existence can be very hard to prove. One million dollars have 
been written out by the Clay institute for a proof in the case of the Navier Stokes equation. Anyhow, as the 
following story shows, mathematicians are crazy about existence: 




An engineer, a chemist and a mathematician are staying in three adjoining cabins at an old motel. First 
the engineer's coffee maker catches fire. He smells the smoke, wakes up, unplugs the coffee maker, throws it 
out the window, and goes back to sleep. Later that night the chemist smells smoke too. She wakes up and 
sees that a cigarette butt has set the trash can on fire. She says to herself, "Hmm. How does one put out 
a fire? One can reduce the temperature of the fuel below the flash point, isolate the burning material from 
oxygen, or both. This could be accomplished by applying water." So she picks up the trash can, puts it in 
the shower stall, turns on the water, and, when the fire is out, goes back to sleep. The mathematician, of 
course, has been watching all this out the window. So later, when he finds that his pipe ashes have set the 
bed-sheet on fire, he is not in the least taken aback. He says: 

"Aha! A solution exists!" 
and goes back to sleep ... 







PDE's IN MULTIVARIABLE CALCULUS. Where did PDE's matter in multivariable calculus? 



• when doing partial derivatives, we checked that functions are solutions of PDE's. Examples were the 
continuity equation, the Laplace equation, the wave equation, the Burger equation. 

• we have seen the meaning of the heat equation T t = AT using the divergence theorem. 

• The Laplace equation Af = defines harmonic functions. 

• We have seen relations between certain PDE's and the topology of the space: 
curl(F) = implied F = grad(/) if simply connected 

div(F) = implied F = curl(G) if every sphere can be collapsed to a point 
grad(f) = implied / = const if region is connected 

• the Maxwell equations illustrate the divergence theorem as well as Stokes theorem. 

• we have seen the Poisson equation div(F) = p describing gravity as well as electrostatics. 
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